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DESIGNATED/ELECTED OFFICE (DO/EO/US) 
CONCERNING A FILING UNDER 35 U.S.C. § 371 


ATTORNEY'S DOCKET NUMBER 

4239-60771 


U S APPLICATION NO OtVnmm^ »C E«. § 1 5) 


IN 1 bRNATlONAL APPLICATION NO. 

PCT/USOO/06946 


INTERNATIONAL FILING DATE 
16 March 2000 


PRIORITY DATE CLAIMED 
16 March 1999 


1 llLh OF INVENTION • 

A NOVEL CHIMERIC PROTEIN FOR PREVENTION AND TREATMFNT OF HTV INFFrTTOM 


APPLICANT(S) FOR D«ag»tJS ' ~ 

Edward A. Berger, Christie M. Del Castillo 



1. 13 This is a FIRST submission of items concerning a filing under 35 U.S.C. § 371 . 

2. □ This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. § 371 . 

3. □ This is an express request to begin national examination procedures (35 U.S.C. § 371(f) at any time 

rather than delay examination until the expiration of the applicable time limit set in 35 U S C 
§ 371(b) and PCT Articles 22 and 39(1). 

4. 12 A proper Demand for International Preliminary Examination was made by the 19* month from the earliest claimed 

priority date. 

5. EI A copy of the International Application as filed (35 U.S.C. § 37 1(c)(2)) 

a. □ is transmitted herewith (required only if not transmitted by the International Bureau). 

b. □ has been transmitted by the International Bureau. 

c. M is not required, as the application was filed in the United States Receiving Office (RO/US). 

6. □ A translation of the International Application into English (35 U S C. § 371(c)(2)). 

7. S Amendments to the claims of the Internationa! Application under PCT Article 19 (35 U.S C. § 371 (c)(3)) 

a □ are transmitted herewith (required only if not transmitted by the International Bureau) 
b □ have been transmitted by the International Bureau. 

c. □ have not been made; however, the time limit for making such amendments has NOT expired. 

d, S have not been made and will not be made. 

8. □ A translation of die amendments to the claims under PCT Article 19 (35 U.S.C. § 371(c)(3)). 

9. H An oath or declaration oftheinventor(s) (35 US.C §371 (c)(4)). 

10. □ A translation of the annexes to the International Preliminary Examination Report under PCT Article 36 (35 U.S.C 

§ 371(c)(5)) 

Items 11. to 16. below concern document(s) or Information included: 

1 1. H An Information Disclosure Statement under 37 C F.R. §§ 1.97 and 1 98, PT01449, copies of references cited (56). 

12. S An assignmem documem for recording. A separate cover sheet m compliance with 37 C F R S8 3 28 and 3 3 1 and the 

Recordal fee of S40.00 is included. 

13. 13 A FIRST preliminary amendment. 

□ A SECOND or SUBSEQUENT preliminary amendment. 

14. □ A substitute specification. 

15 . □ A change ofpowerofattomey and/or address letter 

""az:::... iiiiiiii 

H Computer-readable form. 241 97 

13 Statement in Compliance. 
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''W^FFW^^F^W^ I INTERNATIONAL APPLIC/S 

U^/ y / UZ I PCT/USOO/06946 



APPLICATION NO 



I ATTORNEYS DOCKET NUMBEir^ feaUU f 

4239-60771 



17. 1^ The following fees ai 
BASIC NATIONAL FEE (37 C.F.R. §§ L492(a)(l)-(S)): 

Neither International Preliminary Examination fee (37 C.F.R. § 1.482) 
nor International Search fee (37 C.F.R. § 1.445(a)(2)) paid to USPTO 
and International Search Report not prepared by the EPO or JPO. 



International Preliminary Examination fee (37 C.F.R. § 1.482) not paid to 
USPTO but International Search Report prepared by the EPO or JPQ 



...$1,000.00 
S860.00 



International Preliminary Examination fee (37 C.F.R. § 1 .482) not paid to USPTO 

but International Search fee (37 C.F.R. § 1.445(a)(2)) paid to USPTO as an 

International Searching Authority S710.00 

International Preliminary Examination fee paid to USPTO (37 C.F.R. § 1.482) 

but all claims did not satisfy provisions of PCT Article 33(l)-(4) S690.00 

International Preliminary Examination fee paid to USPTO (37 C.F.R. § 1.482) 

and all claims satisfied provisions of PCT Article 33(l)-(4) $100.00 



ENTER APPROPRIATE BASIC FEE AMOUNT = 



Surcharge ofSBO.OO for furnishing the oath or declaration later than □ 20 □ 30 
months ftom the earliest claimed priority date (37 C.F.R. § 1.492(e)). 



Independent Claims 



NUMBER FILED 



3 -3 = 



MULTIPLE DEPENDENT CLAIM(S) (if app 



NUMBER EXTRA 



icable) 



TOTAL OF ABOVE CALCULATIONS 



□ Reduction of 1/2 for filing by small entity. Small entity status is claimed for this application. 



Proces 'ing fee of $130.00 for furnishing the English translation later than □ 20 □ 30 

Months from the earliest claimed priority date (37 C.F.R. §§ 1 .492(f)). 



TOTAL NATIONAL FEE = 



1-ee for recording the enclosed assignment (37 C.F.R. § I.2i(h)). The assignment must be 
accomp anied by an appropriate cover sheet (37 C.F.R. §§ 3.28, 3.31). S40.00 per property. 



TOTAL FEES ENCLOSED = 



CALCULATIONS (pto use only) 



1,490.00 



b. □ Please charge my Deposit Account No. 
is enclosed. 



of S 1,530.00 to cover the above fees is enclosed. 

the amount of S to cover the above fees. A duplicate copy of this sheet 



The Director is hereby authorized to charge any additional fees that may be required, or credit any overpayment, to Deposit 
Account No 024550. A duplicate copy of this sheet is enclosed. ^ ^ y 



I. H Please re 



the enclosed postcard to confirm that the items listed above have been received. 

■Jpietitioiu 



NOTE: Where an appropriate time limit under 37 C.F.R. § 1.494 or § 1.495 has not beei 
or (b)) must be filed and granted to restore the application to pending statu; 

SEND ALL CORRESPONDENCE TO: 



KLARQUIST SPARKMAN. LLP 
One Worid Trade Center, Suite 1600 
121 S.W. Salmon Street 
Portland. OR 97204-2988 




cc: Docketing 




IN THE UNITED STATES PATENT AND TRADEMARK OFFICE ^ ^ ^^^^ 




In re Application of: Berger and Del Castillo 



Art Unit: Not yet assigned 



Application No. Not yet assigned 
Filed: Herwith 

For: A NOVEL CHIMERIC PROTEIN FOR 

PREVENTION AND TREATMENT OF HIV 
INFECTION 

Examiner: Not yet assigned 

Date: September 13, 2001 
BOX PCT 

COMMISSIONER FOR PATENTS 
WASHINGTON, D.C. 20231 



In the Specification, please insert the following header and paragraph on page 1, immediately 
following the title: 

-CROSS REFERENCE TO RELATED APPLICATIONS 
This is the National Stage of International Application No. PCT/USOO/06946, filed 
March 16, 2000, and claims the benefit of U.S. Provisional Application No. 60/124,681, filed 
March 16, 1999. The provisional application is incorporated herein in its entirety.-- 

By this amendment the specification has been changed to reflect prior related 
applications. No new matter is added by this amendment. 



One World Trade Center, Suite 1600 
121 S.W. Salmon Street 
Portland, Oregon 97204 
Telephone: (503) 226-7391 
Facsimile: (503) 228-9446 



PRELIMINARY AMENDMENT 




Respectfully submitted, 
KLARQUIST SPARK^ 
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Attorney's Matter No. 4239-60771 Date of Deposit: September 13, 2001 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re application of: Berger and Del Castillo Art Unit: Not yet assigned 

Application No. Not yet assigned 
Filed: Herwith 

For: A NOVEL CHIMERIC PROTEIN FOR 
PREVENTION AND TREATMENT OF 
HIV INFECTION 

Examiner: Not yet assigned 
Date: September 13, 2001 



STATEMENT IN COMPLIANCE WITH 37 C.F.R. § 1.821(f) 

BOX PCT 

COMMISSIONER FOR PATENTS 
Washington, DC 20231 

Sir: 

In compliance with 37 C.F.R. § 1 .821(f), the undersigned declares that the nucleotide 
and/or amino acid sequences presented in the paper copy of the "Sequence Listing" submitted 
herewith are the same as the sequences contained in the computer-readable form of the 
"Sequence Listing." 



Respectfully submitted, 
KLARQUIST SPARKMAN, LLP 



^•-^ Tai^a yi. Harding, Ph.D. 
/Kedstration No. 42,630 

One World Trade Center, Suite 1 600 
121 S.W. Salmon Street 
Portland, Oregon 97204 
Telephone: (503) 226-7391 
Facsimile: (503) 228-9446 
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SEQUENCE LISTING 
<110> Berger, et al. 

<120> NOVEL CHIMERIC PROTEIN FOR PREVENTION AND TREATMENT OF 
HIV INFECTION 

<130> 4239-60771 

<140> 
<141> 

<150> PCT/USOO/06946 
<151> 2000-03-16 

<150> 60/124,681 
<151> 1999-03-16 



09/936702 
531BeCdPCT,r- 13 SEP 2001 



<170> PatentIn Ver. 2.1 



<210> 1 
<211> 5 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: linker peptide 

<4oo> r 

Gly Gly Gly Gly Ser 
1 5 



<210> 2 
<211> 35 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: seven repeat 
polypeptide linker 

<400> 2 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
15 10 15 

Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
20 25 30 

Gly Gly Ser 
35 



<210> 3 
<211> 508 



1 



<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CD4-scFv (17b) 



Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 
15 10 15 

Ala Leu Leu Pro Ala Ala Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 30 

Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 
35 40 45 

He Gin Phe His Trp Lys Asn Ser Asn Gin He Lys He Leu Gly Asn 
50 55 60 

Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Ala 
65 70 75 80 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Pro Leu He He 
85 90 95 

Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Glu 
100 105 110 

Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

Ser Asp Thr His Leu Leu Gin Gly Gin Ser Leu Thr Leu Thr Leu Glu 
130 135 140 

Ser Pro Pro Gly Ser Ser Pro Ser Val Gin Cys Arg Ser Pro Arg Gly 
145 150 155 160 

Lys Asn He Gin Gly Gly Lys Thr Leu Ser Val Ser Gin Leu Glu Leu 
165 170 175 

Gin Asp Ser Gly Thr Trp Thr Cys Thr Val Leu Gin Asn Gin Lys Lys 
180 185 190 

Val Glu Phe Lys He Asp He Val Val Leu Ala Phe Gin Lys Ala Ser 
195 200 205 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Glv 
210 215 220 

Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Glv 
225 230 235 240 

Gly Gly Ser Gin Val Gin Leu Leu Glu Ser Gly Ala Glu Val Lys Lys 
245 250 255 

Pro Gly Ser Ser Val Lys Val Ser Cys Lys Ala Ser Gly Asp Thr Phe 
260 265 270 



2 



He Arg Tyr Ser Phe Thr Trp Val Arg Gin Ala Pro Gly Gin Gly Leu 
275 280 285 

Glu Trp Met Gly Arg He He Thr He Leu Asp Val Ala His Tyr Ala 
290 295 300 

Pro His Leu Gin Gly Arg Val Thr He Thr Ala Asp Lys Ser Thr Ser 
305 310 315 320 

Thr Val Tyr Leu Glu Leu Arg Asn Leu Arg Ser Asp Asp Thr Ala Val 
325 330 335 

Tyr Phe Cys Ala Gly Val Tyr Glu Gly Glu Ala Asp Glu Gly Glu Tyr 
340 345 350 

Asp Asn Asn Gly Phe Leu Lys His Trp Gly Gin Gly Thr Leu Val Thr 
355 360 365 

Val Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
370 375 380 



Gly Ser Glu Leu Glu Leu Thr Gin Ser Pro Ala Thr Leu Ser Val Ser 
385 390 395 400 

Pro Gly Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Glu Ser Val Ser 
405 410 415 ^ 

Ser Asp Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Arg Leu 
420 425 430 

Leu He Tyr Gly Ala Ser Thr Arg Ala Thr Gly Val Pro Ala Arg Phe 
435 440 445 

Ser Gly Ser Gly Ser Gly Ala Glu Phe Thr Leu Thr He Ser Ser Leu 
450 455 460 

Gin Ser Glu Asp Phe Ala Val Tyr Tyr Cys Gin Gin Tyr Asn Asn Trp 
465 470 475 480 

Pro Pro Arg Tyr Thr Phe Gly Gin Gly Thr Arg Leu Glu He Lys Leu 
485 490 495 

Val Pro Arg Gly Ser Gly His His His His His His 
500 505 



<210> 4 
<211> 1440 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CD4-scFv (17b) 
<400> 4 

atgaaccggg gagtcccttt taggcacttg cttctggtgc tgcaactggc gctcctccca 60 



3 



gcagccactc agggaaagaa agtggtgctg 
tgtacagctt cccagaagaa gagcatacaa 
attctgggaa atcagggctc cttcttaact 
gactcaagaa gaagcctttg ggaccaagga 
atagaagact cagatactta catctgtgaa 
ctagtgttcg gattgactgc caactctgac 
ctgaccttgg agagcccccc tggtagtagc 
aaaaacatac agggggggaa gaccctctcc 
acctggacat gcactgtctt gcagaaccag 
gtgctagctt tccagaaggc ctccggaggt 
ggtggatccg gtggcggagg gtcgggcggg 
ggaggttcac aggtgcagct gctcgagtct 
gtgaaggtct cctgcaaggc ctctggagac 
cgacaggccc ctggacaagg ccttgagtgg 
gcacactacg caccgcacct ccagggcaga 
acagtctacc tggagctgcg gaatctaaga 
ggagtgtacg agggagaggc ggacgagggg 
tggggccagg gaaccctggt cacggtcacc 
agcggtggcg gcggatctga actcgagttg 
ccaggggaaa gagccaccct ctcctgcagg 
tggtaccagc agaaacctgg ccaggctccc 
gccaccggtg tcccagccag gttcagtggc 
atcagcagcc tgcagtctga agattttgca 



ggcaaaaaag gggatacagt ggaactgacc 120 
ttccactgga aaaactccaa ccagataaag 180 
aaaggtccat ccaagctgaa tgatcgcgct 240 
aacttccccc tgatcatcaa gaatcttaag 300 
gtggaggacc agaaggagga ggtgcaattg 360 
acccacctgc ttcaggggca gagcctgacc 420 
ccctcagtgc aatgtaggag tccaaggggt 4 80 
gtgtctcagc tggagctcca ggatagtggc 540 
aagaaggtgg agttcaaaat agacatcgtg 600 
ggcggtagtg ggggaggcgg ttcaggcgga 660 
ggtggaagcg ggggtggcgg ctccggaggc 720 
ggggctgagg tgaagaagcc tgggtcctcg 780 
accttcatca gatatagttt tacctgggtg 840 
atgggaagga tcatcactat ccttgatgta 900 
gtcacgatta ccgcggacaa gtccacgagc 960 
tctgacgata cggccgtata tttctgtgcg 1020 
gaatatgata ataatgggtt tctgaaacat 1080 
tcaggtggcg gtggctccgg aggtggtggg 1140 
acgcagtctc cagccaccct gtctgtgtct 1200 
gccagtgaga gtgttagtag cgacttagcc 12 60 
aggctcctca tatatggtgc atccaccagg 1320 
agtgggtctg gggcagaatt cactctcacc 1380 
gtttattact gtcagcagta caataactgg 1440 



<210> 5 
<211> 127 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
oligonucleotide 

<400> 5 

cctccggagg tggcggtagt gggggaggcg gttcaggcgg aggtggatcc ggaggcggag 60 
ggtcgggcgg gggtggaagc gggggtggcg gctctggtgg cggaggtacc actagttaag 120 
tgagtag -^27 



<210> 6 
<211> 39 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
encoded by SEQ ID NO: 5 

<400> 6 

Ala Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 
15 10 15 

Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
20 25 30 

Gly Gly Gly Gly Thr Thr Ser 



4 



35 



<210> 7 
<211> 103 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 7 

ttttatggat ccggtggcgg agggtcgggc gggggtggaa gcgggggtgg cggctccgga 60 
ggcggaggtt cacaggtgca gctgctcgag tctggggctg agg 103 



<210> 8 
<211> 32 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
encoded for by SEQ ID NO: 7 

<400> 8 

Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 
15 10 15 

Ser Gly Gly Gly Gly Ser Gin Val Gin Leu Leu Glu Ser Gly Ala Glu 
20 25 30 



<210> 9 
<211> 65 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 9 

taatttatcg atcacgtgac tagtcctagg cccgggtcaa tgatgatgat gatgatggcc 60 
gctgc 65 



<210> 10 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
encoded for by SEQ ID NO: 9 



5 



<400> 10 

Ser Gly His His His His His His 
1 5 



<210> 11 
<211> 131 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: reverse 
oligonucleotide 

<400> 11 

ctagctactc acttaactag tggtacctcc gccacctgag ccgccacccc cgcttccacc 60 

ccccgcccga ccctccgcct ccggatccac ctccgcctga accgcctccc cactaccgcc 120 
acctccggag g 131 



6 
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RAW SEQUENCE LISTING DATE: 10/04/2001 

PATENT APPLICATION: US/09/936,702 TIME: 14:41:19 

Input Set : A:\60771 seq.txt 

Output Set: N:\CRF3\I0042001\I936702.raw 

3 <110> APPLICANT: Berger, et al. 

5 <120> TITLE OF INVENTION: NOVEL CHIMERIC PROTEIN FOR PREVENTION AND TREATMENT OF 

6 HIV INFECTION 

8 <130> FILE REFERENCE: 4239-60771 
C--> 10 <I40> CURRENT APPLICATION NUMBER: US/09/936,702 
C--> 11 <141> CURRENT FILING DATE: 2001-09-13 

13 <150> PRIOR APPLICATION NUMBER: PCT/USOO/06946 

14 <151> PRIOR FILING DATE: 2000-03-16 

16 <150> PRIOR APPLICATION NUMBER: 60/124,681 C ^ i T ^ O F™" r\ 

17 <151> PRIOR FILING DATE: 1999-03-16 t IM f C 11 t U 
19 <160> NUMBER OF SEQ ID NOS : 11 

21 <170> SOFTWARE: Patentin Ver. 2.1 
[;123 <210> SEQ ID NO: 1 
0.A <211> LENGTH: 5 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
;1;;^8 <220> FEATURE: 

Jp9 <223> OTHER INFORMATION: Description of Artificial Sequence: linker peptide 
^;=31 <400> SEQUENCE: 1 ^ 
CI32 Gly Gly Gly Gly Ser 

m3 1 5 

£36 <210> SEQ ID NO: 2 
£p7 <211> LENGTH: 35 
,lf8 <212> TYPE: PRT 

rjg <213> ORGANISM: Artificial Sequence 
; 41 <220> FEATURE: 

^2 <223> OTHER INFORMATION: Description of Artificial Sequence: seven repeat 
fe43 polypeptide linker 

hi5 <400> SEQUENCE: 2 

46 Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 

47 1 5 10 15 
4 9 Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
50 20 25 30 

52 Gly Gly Ser 

53 35 

56 <210> SEQ ID NO: 3 

57 <211> LENGTH: 508 

58 <212> TYPE: PRT 

59 <213> ORGANISM: Artificial Sequence 

61 <220> FEATURE: y 

62 <223> OTHER INFORMATION: Description of Artificial Sequence: CD4 -scFvr 17b 

64 <400> SEQUENCE: 3 V '^r 

65 Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 

66 1 5 10 15 

68 Ala Leu Leu Pro Ala Ala Thr Gin Gly Lys Lys Val Val Leu Gly Lys 

69 20 25 30 
71 Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 
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RAW SEQUENCE LISTING DATE: 10/04/2001 

PATENT APPLICATION: US/09/936,702 TIME: 14:41:19 

Input Set : A:\60771 seq.txt 

Output Set: N:\CIlF3\10042001\I936702.raw 

72 35 40 45 

74 He Gin Phe His Trp Lys Asn Ser Asn Gin He Lys He Leu Gly Asn 

75 50 55 60 

77 Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Ala 

78 65 70 75 80 

80 Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Pro Leu He He 

81 85 90 95 

83 Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Glu 

84 100 105 110 

86 Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 

87 115 120 125 

89 Ser Asp Thr His Leu Leu Gin Gly Gin Ser Leu Thr Leu Thr Leu Glu 

90 130 135 140 

92 Ser Pro Pro Gly Ser Ser Pro Ser Val Gin Cys Arg Ser Pro Arg Gly 

93 145 150 155 160 
95 Lys Asn He Gin Gly Gly Lys Thr Leu Ser Val Ser Gin Leu Glu Leu 

,™96 165 170 175 

l~:98 Gin Asp Ser Gly Thr Trp Thr Cys Thr Val Leu Gin Asn Gin Lys Lys 

---:99 180 185 190 

.:|101 val Glu Phe Lys He Asp He Val Val Leu Ala Phe Gin Lys Ala Ser 

i ,-102 195 200 205 

.-■a04 Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 

•-.f05 210 215 220 

.-107 Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 

T%08 225 230 235 240 

' 4l0 Gly Gly Ser Gin Val Gin Leu Leu Glu Ser Gly Ala Glu Val Lys Lys 

^111 245 250 255 

' -X13 Pro Gly Ser Ser Val Lys Val Ser Cys Lys Ala Ser Gly Asp Thr Phe 

■:>14 260 265 _ 270 

Nai6 He Arg Tyr Ser Phe Thr Trp Val Arg Gin Ala Pro Gly Gin Gly Leu 

Uil7 275 280 285 

^'0.19 Glu Trp Met Gly Arg He He Thr He Leu Asp Val Ala His Tyr Ala 

,120 290 295 300 

■ 122 Pro His Leu Gin Gly Arg Val Thr He Thr Ala Asp Lys Ser Thr Ser 
123 305 310 315 320 

125 Thr Val Tyr Leu Glu Leu Arg Asn Leu Arg Ser Asp Asp Thr Ala Val 

126 325 330 335 

128 Tyr Phe Cys Ala Gly Val Tyr Glu Gly Glu Ala Asp Glu Gly Glu Tyr 

129 340 345 350 

131 Asp Asn Asn Gly Phe Leu Lys His Trp Gly Gin Gly Thr Leu Val Thr 

132 355 360 365 

134 Val Thr Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 

135 370 375 380 

137 Gly Ser Glu Leu Glu Leu Thr Gin Ser Pro Ala Thr Leu Ser Val Ser 

138 385 390 395 400 

140 Pro Gly Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Glu Ser Val Ser 

141 405 410 415 

143 Ser Asp Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Arg Leu 

144 420 425 430 
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RAW SEQUENCE LISTING DATE: 10/04/2001 

PATENT APPLICATION: US/09/936,702 TIME: 14:41:19 

Input Set : A:\6077I seq.txt 

Output Set: N:\CRF3\10042001\I936702.raw 

146 Leu He Tyr Gly Ala Ser Thr Arg Ala Thr Gly Val Pro Ala Arg Phe 

147 435 440 445 

149 Ser Gly Ser Gly Ser Gly Ala Glu Phe Thr Leu Thr He Ser Ser Leu 

150 450 455 460 

152 Gin Ser Glu Asp Phe Ala Val Tyr Tyr Cys Gin Gin Tyr Asn Asn Trp 

153 465 470 475 480 

155 Pro Pro Arg Tyr Thr Phe Gly Gin Gly Thr Arg Leu Glu He Lys Leu 

156 485 490 495 
15 8 Val Pro Arg Gly Ser Gly His His His His His His 

159 500 505 

162 <210> SEQ ID NO: 4 

163 <211> LENGTH: 1440 

164 <212> TYPE: DNA 

165 <213> ORGANISM: Artificial Sequence 

167 <220> FEATURE: 

168 <223> OTHER INFORMATION: Description of Artificial Sequence: CD4-scFv(17b) 
__170 <400> SEQUENCE: 4 

^71 atgaaccggg gagtcccttt taggcacttg cttctggtgc tgcaactggc gctcctccca 60 
m.72 gcagccactc agggaaagaa agtggtgctg ggcaaaaaag gggatacagt ggaactgacc 120 
iiJl73 tgtacagctt cccagaagaa gagcatacaa ttccactgga aaaactccaa ccagataaag 180 
ij.J174 attctgggaa atcagggctc cttcttaact aaaggtccat ccaagctgaa tgatcgcgct 240 
qU75 gactcaagaa gaagcctttg ggaccaagga aacttccccc tgatcatcaa gaatcttaag 300 
^.,|176 atagaagact cagatactta catctgtgaa gtggaggacc agaaggagga ggtgcaattg 3 60 
=l|177 ctagtgttcg gattgactgc caactctgac acccacctgc ttcaggggca gagcctgacc 420 
|'J178 ctgaccttgg agagcccccc tggtagtagc ccctcagtgc aatgtaggag tccaaggggt 4 80 
'"179 aaaaacatac agggggggaa gaccctctcc gtgtctcagc tggagctcca ggatagtggc 540 
acctggacat gcactgtctt gcagaaccag aagaaggtgg agttcaaaat agacatcgtg 600 
fejIIBl gtgctagctt tccagaaggc ctccggaggt ggcggtagtg ggggaggcgg ttcaggcgga 660 
fe|182 ggtggatccg gtggcggagg gtcgggcggg ggtggaagcg ggggtggcgg ctccggaggc 720 
l^^^a.83 ggaggttcac aggtgcagct gctcgagtct ggggctgagg tgaagaagcc tgggtcctcg 780 
|i,f84 gtgaaggtct cctgcaaggc ctctggagac accttcatca gatatagttt tacctgggtg 840 
rp5 cgacaggccc ctggacaagg ccttgagtgg atgggaagga tcatcactat ccttgatgta 900 
|^;186 gcacactacg caccgcacct ccagggcaga gtcacgatta ccgcggacaa gtccacgagc 960 
' 187 acagtctacc tggagctgcg gaatctaaga tctgacgata cggccgtata tttctgtgcg 1020 

188 ggagtgtacg agggagaggc ggacgagggg gaatatgata ataatgggtt tctgaaacat 1080 

189 tggggccagg gaaccctggt cacggtcacc tcaggtggcg gtggctccgg aggtggtggg 1140 

190 agcggtggcg gcggatctga actcgagttg acgcagtctc cagccaccct gtctgtgtct 1200 

191 ccaggggaaa gagccaccct ctcctgcagg gccagtgaga gtgttagtag cgacttagcc 1260 

192 tggtaccagc agaaacctgg ccaggctccc aggctcctca tatatggtgc atccaccagg 1320 

193 gccaccggtg tcccagccag gttcagtggc agtgggtctg gggcagaatt cactctcacc 1380 

194 atcagcagcc tgcagtctga agattttgca gtttattact gtcagcagta caataactgg 1440 

197 <210> SEQ ID NO: 5 

198 <211> LENGTH: 127 

199 <212> TYPE: DNA 

200 <213> ORGANISM: Artificial Sequence 

202 <220> FEATURE: 

203 <223> OTHER INFORMATION: Description of Artificial Sequence: synthetic ^ 

204 oligonucleotide 
206 <400> SEQUENCE: 5 
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RAW SEQUENCE LISTING DATE: 10/04/2001 

PATENT APPLICATION: US/09/936,702 TIME: 14:41:19 

Input Set : A:\60771 seq.txt 

Output Set: N:\CRF3\10042001\I936702.raw 

207 cctccggagg tggcggtagt gggggaggcg gttcaggcgg aggtggatcc ggaggcggag 60 

208 ggtcgggcgg gggtggaagc gggggtggcg gctctggtgg cggaggtacc actagttaag 120 

209 tgagtag 227 

212 <210> SEQ ID NO: 6 

213 <211> LENGTH: 39 

214 <212> TYPE: PRT 

215 <213> ORGANISM: Artificial Sequence 

217 <220> FEATURE: 

218 <22 3> OTHER INFORMATION: Description of Artificial Sequence: peptide 

219 encoded by SEQ ID NO: 5 

221 <400> SEQUENCE: 6 

222 Ala Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 

223 15 10 15 

22 5 Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
226 20 25 30 

228 Gly Gly Gly Gly Thr Thr Ser 
35 

P232 <210> SEQ ID NO : 7 
y;l233 <211> LENGTH: 103 
y;|234 <212> TYPE: DNA 

|iJ235 <213> ORGANISM: Artificial Sequence 
fi^|237 <220> FEATURE: 

ri238 <223> OTHER INFORMATION: Description of Artificial Sequence- primer 
P=p40 <400> SEQUENCE: 7 

p5241 ttttatggat ccggtggcgg agggtcgggc gggggtggaa gcgggggtgg cggctccgga 60 
•■■Q42 ggcggaggtt cacaggtgca gctgctcgag tctggggctg agg 103 
p 245 <210> SEQ ID NO : 8 
£124 6 <211> LENGTH: 32 
i|!i247 <212> TYPE: PRT 

|,ii24 8 <213> ORGANISM: Artificial Sequence 
lJ50 <220> FEATURE: 

^J51 <223> OTHER INFORMATION: Description of Artificial Sequence: peptide 
y?52 encoded for by SEQ ID NO: 7 

^■'254 <400> SEQUENCE: 8 

255 Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 

256 15 10 15 

258 Ser Gly Gly Gly Gly Ser Gin Val Gin Leu Leu Glu Ser Gly Ala Glu 

259 20 25 30 

265 <210> SEQ ID NO: 9 

266 <211> LENGTH: 65 

267 <212> TYPE: DNA 

258 <213> ORGANISM: Artificial Sequence 

270 <220> FEATURE: 

271 <223> OTHER INFORMATION: Description of Artificial Sequence: primer ^ 

273 <400> SEQUENCE: 9 

274 taatttatcg atcacgtgac tagtcctagg cccgggtcaa tgatgatgat gatgatggcc 60 

275 gctgc 

278 <210> SEQ ID NO: 10 

279 <211> LENGTH: 8 
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RAW SEQUENCE LISTING DATE: 10/04/2001 

PATENT APPLICATION: US/09/936,702 TIME: 14:41:19 

Input Set : A:\60771 seq.txt 

Output Set: N:\CRF3\10042001\I936702.raw 

280 <212> TYPE: PRT 

281 <213> ORGANISM: Artificial Sequence 

283 <220> FEATURE: 

284 <223> OTHER INFORMATION: Description of Artificial Sequence: peptide^ 

285 encoded for by SEQ ID NO: 9 

287 <400> SEQUENCE: 10 

288 Ser Gly His His His His His His 

289 1 5 

292 <210> SEQ ID NO: 11 

293 <211> LENGTH: 131 

294 <212> TYPE: DNA 

2 95 <213> ORGANISM: Artificial Sequence 
297 <220> FEATURE: 

2 98 <223> OTHER INFORMATION: Description of Artificial Sequence: reverse 

2 99 oligonucleotide 

3 01 <400> SEQUENCE: 11 

.. 302 ctagctactc acttaactag tggtacctcc gccacctgag ccgccacccc cgcttccacc 60 
■303 ccccgcccga ccctccgcct ccggatccac ctccgcctga accgcctccc cactaccgcc 120 
-.3 04 acctccggag g 23i 
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A NOVEL CHIMERIC PROTEIN FOR PREVENTION AND 
TREATMENT OF HIV INFECTION "^^""^ 



FIELD OF THE INVENTION 

This invention relates to proteins useful in the prevention and treatment of human 
immunodeficiency vims (HIV) infection. More specifically, it relates to fusion proteins that bind to 
two sites on a single target protein, especially when one binding domain of the fusion protein binds 
to an induced site (on the target protein) that is exposed by the binding of the other binding domain 
of the fiision protein. 



BACKGROUND OF THE INVENTION 

Acquired immune deficiency syndrome (AIDS) is a fatal disease of growing prevalence in 
the modem world. The agent responsible for this disease, human immunodeficiency virus (HIV), 
was first identified in 1983. HIV is a T-lymphotropic retrovirus that invades and replicates in cells of 
the immune system, primarily helper T-lymphocytes. The consequent dysfimction in T-lymphocyte- 
mediated immimity results in an immuno-compromised condition. Patients usually die of associated 
opportunistic viral, bacterial or fungal infections. A characteristics laboratory finding in AIDS is the 
decrease in helper T lymphocytes (CD4), and particularly a steady decrease in the ratio of CD4 to 
suppressor T lymphocytes CDS as the disease progresses. Virus binding is primarily mediated by 
interaction of gpl20, the extemal subunit of the HIV envelope glycoprotein (Env) with CD4 protein 
and various coreceptor molecules (one of several alternative chemokine receptors). These 
interactions then activate the gp41 transmembrane subunit of the envelope glycoprotein, to cause 
fiision between the virus and cell membranes. See Retroviruses, Coffin et al. (eds.) (1997) CSHP, 
New York, Ch. li. 

The himioral immune system is triggered by HIV infection, though it generally does not 
provide sufficient protection to ward off the infection. Env is the major target of anti-HIV 
neutralizing antibodies (Wyatt et al. Nature 393:705-71 1, 1998). However, Env has evolved so that 
its relatively invariant neutralizing determinants are protected from the humoral immime system. 
Antibodies to these regions therefore are generated at a low frequency and their neutralizing activities 
in vivo are generally weak. Certain variable regions {e.g., the V3 loop) are targets for potent 
neutralizing antibodies, but these are typically restricted to a limited number of HIV-strains (in other 
words, they are not broadly cross-reactive). For a list of several gpl20 antigenic epitopes and 
consensus definitions of the conserved and variable regions of gpl20, see published PCT application 
PCT/US98/02766 (pubhcation number WO 98/36087) and Coffin et al. (eds.) (1997) CSHP, New 
York, Ch. 12. 

A neutralizing monoclonal antibody (MAb) with potent and broadly cross-reactive activity 
would have great potential value in protocols aimed at preventing HIV infection before or 
immediately after exposure, for example in neonatal transmission, post-exposure prophylaxis, and as 
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a topical inhibitor. Such a MAb may also be useful in treating chronic infection (D'Souza et al J. 
Infect. Dis. 175:1056-1062, 1997). However only a handful of MAbs with the desired broadly cross- 
reactive neutralizing activities have been described. Because of limited potency and cross-reactivity 
of these molecules, even the three most promismg candidates have questionable clinical value 
5 (D'Souza era/., 1997). 

Extensive efforts are underway to provide immunological or pharmacological approaches to 
controlling HIV infection (Coffin etal, 1997, Ch. 12). The specific interaction between gpl20 and 
CD4 has been exploited in efforts to provide a possible treatment for HIV infection. See, e.g., U.S. 
Patent No. 5,817,767; Capon et al. Nature 337:525-531, 1989. A soluble fragment of CD4 (sCD4), 

10 comprising the first and second domains of this protein (D1D2) has been generated, and this 

molecule interacts specifically with gpl20, essentially serving as a molecular decoy. sCD4 has been 
shown to block the spread of HIV between cultured cells (Moore et at.. Science 250:1 139-1 142, 
1990). However, clinical trials with sCD4 were inconclusive as to the effects on human viral load 
(Schooley et al, Ann. Internal Med. 112:247-253, 1990; K.ahaet al,Ann. Internal Med. 112:254- 
: 15 261, 1990). Subsequent studies indicated that, unlike laboratory-adapted HIV strains, isolates 

obtained directly from infected patients (primary isolates) are resistant to neutralization by sCD4 
(Darr et al, Proc. Natl Acad. Sci. 87:6574-6578, 1990), 

In another approach, researchers have generated an antibody-like molecule by fusing the 
binding portion of CD4 to the constant region (Fc) of a human IgG heavy chain (see, e.g.. Capon et 

20 al. Nature 337:525-531, 1989; and Bym et al.. Nature 344:667-670, 1990). This molecule, termed 

CD4 immunoadhesin, exploits the native functions of immunoglobulin Fc, such as its ability to fix 
complement, its ability to mediate antibody-dependent cytotoxicity, and its transfer across the 
placental barrier. There are significant drawbacks to using Fc receptors in association with CD4, 
because such a constmct may be responsible for targeting HIV to Fc-receptor bearing cells (e.g. 

25 macrophages), and might lead to increased transmission of HIV-1 across the placental barrier. 

A complementary recombinant molecule has also been made, wherein the binding portion of 
CD4 is fused to the Fv region of an antibody directed to human CD3; this "Janusin" molecule may be 
able to re-target cytotoxic T-lymphocytes onto HIV-infected cells (Traunecker et al, Embo J. 
10:3655-3659, 1991; Traunecker et al. Int. J. Cancer: Supp. 7:51-52, 1992). Janusin has been 

30 reported to inhibit HIV-mediated cell fusion when administered in vitro with neutralizing antibody to 

either gp41 or the V3 loop of gpl20 (Allaway etal, AIDS Res. Hum. Retroviruses 9:581-587, 1993; 
U.S. Patent No. 5,817,767). This system is inherently complicated and inefficient because multiple 
molecules must be co-administered to the subject. 

This invention is directed to proteins that address key failures of the prior art. 

35 
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SUMMARY OF THE INVENTION 

The present invention takes advantage of the finding that the neutrahzing activities of MAbs 
against certain highly conserved determinants of the coreceptor-binding region of gpl20 are revealed 
only when CD4 first binds to gpl20 (as in an sCD4-activated fusion assay). Although some MAbs to 
5 CD4-induced epitopes (e.g., the human MAbs 17b and 48d, Thali et ah, J. Virol. 67:3978-3988, 

1993) are broadly cross-reactive with Envs from diverse HIV genetic subtypes (Clades), these 
neutralizmg epitopes are only briefly exposed in vivo, and therefore have provided poor targets for 
clinically protective antibody binding. 

The inventors have overcome these difficulties by creating a fusion protein containing a 

10 fragment of CD4 attached via a linker to a human single chain Fv directed against an induced (for 

example, a CD4-induced) neutralizing epitope on gpl20, for instance a coreceptor-binding 
determinant of gpl20. CD4-binding exposes highly conserved gpl20 determinants involved in 

Ti binding to coreceptor; therefore the provided fusion protein will have the properties of a highly 

potent, broadly cross-reactive neutralizing antibody with high in vivo activity and no Fc-mediated 

■il5 undesirable targeting properties. When the fusion protein is substantially derived from human 

proteins, it has minimal immunogenicity and toxicity in humans. Such an agent has great value in 
the prevention of infection during or immediately after HIV exposure (mother/infant transmission, 
post-exposure prophylaxis, topical inhibitor), and also in the treatment of chronic infection. 

Accordingly, a first embodiment of the current invention is a neutralizing bispecific fusion 

l20 protein capable of binding to two sites on a target protein. This protein has two different binding 

domains, an inducing-binding domain and an induced-binding domain, functionally linked by a 
peptide linker. Nucleic acid molecules encoding such fusion proteins are fiinher aspects of this 
invention. Also encompassed in the invention are protein analogs, derivatives, or mimetics of such 
neutralizing bispecific fusion proteins. The arrangement of the inducing- and induced-binding 

25 domains need not be organized in binding sequence; the amino-proximal or carboxy-proximal 

binding domain of the fusion protein may be either the induced-binding or the inducing-binding 
domain. 

In certain embodiments, the linker of this invention is of such length and secondary 
structure that the linker allows the second binding domain to be in binding proximity to the induced 

30 epitope of the target protein when the first bindmg domain is bound to the inducing site of the target 

protein. The linker may for instance be substantially flexible. Linkers of about 25-1 00 angstroms 
(A), or about 15-100 amino acid residues in length, are examples of linkers of a sufficient length to 
maintain the second binding domain in binding proximity to the induced epitope. Specific examples 
of linkers will include one or more occurrences of the amino acid sequence represented by SEQ ID 

35 NO: 1 . For instance, the invention encompasses bispecific fusion proteins wherein the two binding 

domains are functionally linked by the amino acid sequence represented by SEQ ID NO: 2. 
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Targets for bispecific fusion proteins according to this invention include viral envelope 
proteins. For instance, viral envelope proteins from the hxaman immunodeficiency virus (HIV) are 
targets for the disclosed invention. In a specific embodiment of the invention, the viral envelope 
protein target is gpl 20. 

5 In further aspects of the invention, the first binding domain is capable of binding to an 

inducing site on the target protein, thereby exposing an induced epitope. For instance, the first 
binding domain can be a ligand such as CD4 or fragments thereof Alternatively, such a first binding 
domain may be a binding portion of a variable region of an antibody heavy or light chain. The first 
binding domain may include, for example, an antibody-binding domain, a single-chain Fv (SCFv), or 
10 binding fragments thereof. 

The second binding domain, which is capable of forming a neutralizing complex with an 
induced epitope of the target protein, may be for example an antibody or fragments thereof, such as 
;s the variable region, Fv, Fab or antigen-binding domain of an antibody. Another example of the 

S second binding domain of the fusion protein is an engineered single-chain Fv (SCFv). 

■i 5 In some particular examples where HIV gp 120 is the target, and the inducing site is the 

:| gpl20 CD4 binding site, the induced epitope may be a coreceptor-binding determinant of gpl20. 

'I Accordingly, aspects of this invention include proteins in which the first binding domain binds to 

gpl 20 in such a way as to cause a CD4- induced conformational change in the complexed gpl 20 that 
exposes the second binding domain. The first binding domain may be derived from a CD4 molecule, 
\20 and include CD4 and soluble fragments thereof (sCD4, e.g. Dl, D1D2 and other such fragments), 

,1 and proteins that mimic the biological activity of a CD4 molecule in binding to the inducing site of 

" gpl 20. In another embodiment of the invention, the first domain of the gpl20-targeted bispecific 

fusion protein is derived from a CD4 anti-idiotypic antibody, or antibodies that mimic CD4 in 
exposing epitopes. 

25 The second domain of the gpl20-targeted bispecific fusion protein, which binds to an 

epitope induced by binding of the first fusion domain, may be chosen from domains and fragments of 
proteins that bind to such CD4 induced epitopes. Antibodies directed to the induced epitopes, as well 
as the HIV coreceptor (e.g. a chemokine receptor), HIV coreceptor mimics, and fragments of HIV 
coreceptor proteins, are examples of sources for the second binding domain of a gpl20-target 

30 bispecific fusion protein of this invention. Examples of chemokine receptors with HIV coreceptor 

activity include CXCR4, CCR5, CCR2B, and CCR3. Neutralizing antibodies, including 1 7b and 
48d, are examples of antibodies. Fusion proteins wherein the second domain is an engineered single 
chain Fv (SCFv) derived from such a neutralizing antibody are also encompassed. 

A particular embodiment of this invention is a functional recombinant bispecific fusion 

35 protein capable of binding to two sites on gpl 20, wherein the inducing-binding domain is sCD4; the 
induced-bindmg domain is SCFv(17b); and these two domains are linked by a linker of a length 
sufficient to maintain the SCFv(17b) in binding proximity an SCFv(17b) epitope when sCD4 is 
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bound to gpl20. A prototypical bispecific fusion protein has the amino acid sequence shown in SEQ 
ID NO: 3. Nucleic acid molecules encoding such a fusion protein are also encompassed; the 
prototypical nucleic acid molecule has the sequence shown in SEQ ID NO: 4. Vectors and cells 
comprising this nucleic acid molecule are also encompassed in the ciirrent invention, as are 
transgenic plants and animals that express the nucleic acid molecule. 

This invention also provides methods for producing functional recombinant bispecific 
fusion proteins capable of binding two sites on a target protein. Such a protein can be produced m a 
prokaryotic or eukaryotic cell (e.g., yeast, insect and mammalian cells), for instance by transforming 
or transfecting such a cell with a recombinant nucleic acid molecule comprising a sequence which 
encodes a disclosed bispecific fusion protein. Such transformed cells can then be cultured under 
conditions that cause production of the fttsion protein, which is then recovered through protein 
purification means. The protein can include a molecular tag, such as a six-histidine tag, to facilitate 
its recovery. In particular embodiments, the protein has a hexa-histidine (hexa-his) tag, and a 
thrombin cleavage site. 

The invention further provides methods for inactivating a target protein, for instance a 
gpl20 protein, by contacting the target protein with a fusion protein according to this invention. 
Where the target protein is gpl20, this method involves contacting gpl20 with a gpl20-targeted 
bispecific fusion protein, for mstance sCD4-SCFv(17b). Proteins according to the current invention 
can also be used to neutralize a human immunodeficiency virus, by contactmg the human 
immunodeficiency virus with a gpl20-targeted fusion protein according to this invention. Binding of 
a viral or recombinant gpl20 protein to soluble CD4 or lymphocyte CD4 can also be blocked and/or 
prevented by contacting the gpl20 protein with gpl20-targeted fusion protein. In any of these 
methods, a variant protein, analog or mimetic of the fusion protein as provided herein may also be 
used. 

Proteins of the current invention can be used to inhibit virus replication or infectivity in a 
subject by administering to the subject an amount of the fusion protein (for example the sCD4- 
SCFv(17b) fusion protein), or a variant protein, analog or mimetic thereof, sufficient to inhibit HIV 
virus replication or infectivity. The fusion protein can be administered in a pharmaceutical 
composition, and given therapeutically to a person who is known to be infected with HIV, or 
prophylactically to help prevent infection in someone who has been exposed to the virus, or is at high 
risk for exposure. Proteins of this invention can also be administered in combination with another 
compound for the treatment or prevention of HIV infection, such as an HIV reverse transcriptase 
(RT), integrase, or protease inhibitor, another HIV-1 neutralizing antibody, or an Env-targeted toxin. 
The other drug may be an HIV antiviral agent, an HIV anti-infective agent, and/or an 
immunomodulator, or combinations thereof. 



wo 00/55207 



PCT/USOO/06946 



-6- 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a bar graph illustrating relative HIV-1 Env-mediated fusion, in the presence (+) 
or absence (-) of soluble CD4, between effector cells expressing Env (Ba-L) and target cells 
expressing CCR5 (co-receptor), but no CD4 (primary receptor). 

Figure 2 is a graph showing that antibody 17b does not inhibit HIV-1 Etiv-mediated fusion 
in the conventional assay (open box: CXCR4 and CD4 on target cell), but strongly inhibits cell 
fusion in the sCD4-activated assay (filled circle: only CXCR4 on target cell, sCD4 provided). 
Additional experiments indicate that this phenomenon occurs with diverse Envs using either CXCR4 
or CCR5, and that 17b has broad cross-reactive activity with Envs from genetically diverse HIV-1 
isolates. 

Figure 3 is a schematic diagram of the CD4-SCFv(17b) genetic construct. The genetic 
constmct encodes sCD4 (D1D2, plus the native CD4 N-terminal signal sequence), followed by the 
LI linker (Gly^Ser),, which attaches the 17b SCFv (Vh attached to V^ via the L2 linker (Gly4Ser)3), 
followed by the thrombm cleavage site and hexa-his tag. There is a BamH I site in the middle of LI 
to facilitate production of constructs of different lengths. 

Figure 4 is a drawing of mechanisms of binding of a sCD4-SCFv(17b) to gpl20, and the 
resulting neutralization of HIV Env function (fusion and infectivity). 

Figs. 4A, 4B, and 4C depict the proposed interaction of HIV (mediated by gpl20) with the 
cell surface receptor CD4 and co-receptor CCR5, and the beginning of fusion (mediated by gp41). 
Interaction between gpl20 and CD4 (Fig. 4A) causes a change in the conformation of gpl20 (Fig. 
4B), which enables interaction between gpl20 and CCR5 (Fig. 4B). This triggers a conformational 
change in gp41 (Fig. 4C), and leads to fusion. Antibody (for instance, MAb 17b) binds poorly to the 
transiently exposed epitope on gpl20 (Fig. 4B), and thus results in only weak neutralization of fusion 
or infection. 

Figs. 4D and 4E depict a proposed mechanism of sCD4-SCFV(17b) neutralization of fusion. 
In the presence of the bispecific chimeric fusion protein, the sCD4 domain can bmd to gpl20 and 
induce a conformational change in this protein sufficient to permit binding of the SCFV(17b) (Fig. 
4D). This effectively blocks fusion between the HIV and infection and the target cell. 

SEQUENCE LISTING 

The nucleic and amino acid sequences listed in the accompanying sequence listing are 
shown using standard letter abbreviations for nucleotide bases, and three letter code for amino acids. 
Only one strand of each nucleic acid sequence is shown, but the complementary strand is understood 
as included by any reference to the displayed strand. 

SEQ ID NO: 1 shows the basic repeat cassette for a linker polypeptide. 

SEQ ID NO: 2 shows a seven-repeat polypeptide linker 

SEQ ID NO: 3 shows the amino acid sequence of the CD4-SCFv(17b) chimeric protein. 
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SEQ ID NO: 4 shows the nucleic acid sequence of CD4-SCFv(I7b). 

SEQ ID NO: 5 shows the pair of synthetic oligonucleotides used to form the second half of 
the Stu I site near the 3' end of CD4 and to produce an Spe I overhang at the 3' end of an 
intermediate construct (site to be destroyed upon ligation into pCB-3); the oligonucleotide sequences 
5 reconstruct the remainder of the second domain of CD4 (through ser,83), and encode an amino acid 
sequence including alaisjser.gj of CD4 D2 plus an intermediate 37 residue linker (gly4ser)6gly4thr2ser, 
followed directly by the universal translational termination sequence (UTS). 

SEQ ID NO: 6 shows the peptide sequence encoded for by the nucleotide sequences in SEQ 
ID NO: 5. 

1 0 SEQ ID NO: 7 shows the forward (5 ') primer used to amplify and attach the 1 7b SCFv 

sequence to the CD4- linker sequence in pCD2. Italics show the region of the primer that overlaps 
with 17b. 

SEQ ID NO: 8 shows the amino acid sequence encoded by the oligonucleotide primer in 
SEQ ID NO:7. This sequence includes the GlySer residues at the third (Gly^Ser) repeat within LI 
1 5 (encoded by the BamH I site, followed by the remaining four (Gly^Ser) repeats, followed by the first 

ten residues of the 17b SCFv (shown in italics). 

SEQ ID NO: 9 shows the 3' primer used to amplify and attach the 17b SCFv sequence plus 
the thrombin cleavage site and the hexa-his tag to the CD4- linker sequence in pCD2. 

SEQ ID NO: 10 shows the peptide encoded for by the nucleotide sequence in SEQ ID NO: 

20 9. 

DETAILED DESCRIPTION OF THE INVENTION 
I. Abbreviations and Definitions 

25 A. Abbreviations 

HIV: human immunodeficiency vims 

gpl20: the extemal subunit of the envelope glycoprotein complex of HIV 
Env: the envelope glycoprotein complex of HIV 
MAb: monoclonal antibody 
30 Fv: antibody "fragment variable", the variable region of an antibody 
SCFv: single-chain antibody variable region 

B. Definitions 

Unless otherwise noted, technical terms are used according to conventional understanding. 
35 Definitions of common terms in molecular biology may be found in Benjamin Lewin, Genes V, 

Oxford University Press, 1994 (ISBN 0-19-854287-9); Kendrew et al. (eds.). The Encyclopedia of 
Molecular Biology, Blackwell Science Ltd., 1994 (ISBN 0-632-02182-9); and Robert A. Meyers 
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(ed.), Molecular Biology and Biotechnology: a Comprehensive Desk Reference, VCH Publishers, 
Inc., 1995 (ISBN 1-56081-569-8). 

In order to facilitate review of the various embodiments of the invention, the following 
definitions of terms are provided: 

Animal: Living multi-cellular vertebrate organisms, a category that includes, for example, 
mammals and birds. The term mammal includes both human and non-human mammals. Similarly, 
the terra "subject" includes both human and veterinary subjects. 

Bispecific fusion protein: Proteins that have at least two domains fused together, each 
domain comprising a binding region capable of forming a specific complex with a target protein. In 
general, the two domains are genetically fused together, in that nucleic acid molecules that encode 
each protein domam are functionally linked together, for instance by a linker oligonucleotide, 
thereby producing a single fusion-encoding nucleic acid molecule. The translated product of such a 
fusion-encoding nucleic acid molecule is the bispecific fusion protein. 

The two binding regions of such a bispecific protein may associate with two different 
bindmg determinants or epitopes on a single target molecule. One binding domain may bind first to 
such a target and thereby induce a conformational change in the target such that the binding of the 
second binding domain to the target is enabled, facilitated, or otherwise increased in affinity. In such 
an instance, the domam that binds first to the target can be referred to as the inducing-bmding 
domain, while the domain that binds second is the induced-binding domain. These fiision protein 
domains need not be organized in binding sequence; the amino-proximal binding domain of the 
fiision protein may be either the mduced-binding or the inducing-binding domain; likewise for the 
carboxy-proximal binding domain. 

Bispecific fusion proteins can be further labeled according to the target protein they bind to 
and neutralize. For mstance, a bispecific fusion protein according to the current invention that bmds 
to two specific sites on HIV gpl20 protein may be referred to as a gp 120- targeted bispecific fiision 
protein. 

CD4: Cluster of differentiation factor 4, a T-cell surface protein that mediates interaction 
with the MHC class II molecule. CD4 also serves as the primary receptor site for HIV on T-cells 
during HIV infection. 

Molecules that are derived from CD4 include fragments of CD4, generated either by 
chemical (e.g. enzymatic) digestion or genetic engineering means. Such a fragment may be one or 
more entire CD4 protein domains (for example, extracellular domains Dl, D2, D3, and D4), as 
defined in the immunological literature, or a portion of one or more of these well-defined domains. 
For instance, a binding molecule or binding domain derived from CD4 would comprise a sufficient 
portion of the CD4 protein to mediate specific and functional interaction between the binding 
fragment and a native or viral binding site of CD4. One such binding fragment includes both the Dl 
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and D2 extracellular domains of CD4 {CD4 D1D2), though smaller fragments may also provide 
specific and functional CD4-Iike binding. The gpl20-binding site has been mapped to Dl of CD4. 

The term "CD4-derived molecules" also encompasses analogs (non-protein organic 
molecules), derivatives (chemically functionalized protein molecules obtained starting with the 
disclosed protein sequences) or mimetics (three-dimensionally similar chemicals) of the native CD4 
structure, as well as proteins sequence variants or genetic alleles, that maintain the ability to 
functionally bind to a target molecule. 

CD4-induced conformational change: A change induced in the three-dunensional 
conformation of the interacting gpl20 protein when CD4 specifically interacts with gpl20 to form a 
complex. One characteristic of such a change is the exposure of at least one induced epitope on the 
interacting gpl20 molecule. An epitope induced by such a change is called a CD4-induced epitope. 
Such a CD4-induced epitope may for instance include gpl20 epitopes at or near the co-receptor- 
binding region of the protein. 

In addition to CD4 binding, the binding of other molecules may induce the exposure of 
induced epitopes on gpl20. Such other uiducing molecules are considered CD4-like in terms of their 
epitope-inducing ability, to the extent that they expose epitopes congruent with or equivalent to those 
mduced epitopes exposed upon the binding of native CD4. These other inducing molecules include, 
but in no way are limited to, fragments of CD4, for instance sCD4, or a fragment containmg the Dl 
or Di and D2 domains of native CD4. A mannose-specific lectin (SC) may also serve to expose a 
CD4-induced epitope (see U.S. Patent No. 5,843,454), as can certain anti-gpl20 MAbs. 

Complex (complexed): Two proteins, or fragments or derivatives thereof, are said to form 
a complex when they measurably associate with each other in a specific manner. Such association 
can be measured in any of various ways, both dnect and indirect. Direct methods may include co- 
migration in non-denaturing fractionation conditions, for instance. Indirect measurements of 
association will depend on secondary effects caused by the association of the two proteins or protein 
domains. For instance, the formation of a complex between a protein and an antibody may be 
demonstrated by the antibody-specific inhibition of some function of the target protein. In the case 
of gpl20, the formation of a complex between gpl20 and a neutralizing antibody to this protein can 
be measured by determining die degree to which the antibody inhibits gpl20-dependent cell fusion or 
HIV infectivity. Cell fusion inhibition and infectivity assays are discussed further below. 

Exposing an induced epitope: The process by which two proteins interact specifically to 
form a complex (an inducing complex), thereby causing a conformational change in at least one of 
the two proteins (the target protein) such that at least one previously poorly accessible epitope (an 
induced epitope) is made accessible to intramolecular interaction. The formation of such an mducmg 
complex will generally cause the exposure of more than one induced epitope, each of which may be 
thereby rendered accessible for intramolecular interaction. 
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HIV coreceptor: A cell-surface protein other than CD4 involved in the interaction of HIV 
virus and its subsequent entry into a target cell. These proteins may also be referred to as fusion 
coreceptors for HIV. Examples of such coreceptor proteins include, for instance, members of the 
chemokine receptor family (e.g. CXCR4, CCR5, CCR3, and CCR2B). 

HIV coreceptor proteins interact with coreceptor binding determinants of gpl20. In general, 
it is believed that some of these determinants are exposed on gpl20 only after the specific interaction 
of gpl20 with CD4, and the consequent CD4-induced conformational change in the interacting 
gpl20. Thus certam HIV coreceptor binding determinants are, or overlap with, CD4-induced 
epitopes. 

Neutralization of gpl20 can be achieved by the specific binding of neutralizing proteins or 
protein fragments or domains to one or more coreceptor binding determinants of gpl20, thereby 
blocking interaction between complexed gpl20 and the native coreceptor. 

HIV neutralizing ability: The measurable ability of a molecule to inhibit infectivity of 
HIV virus, either in vivo or in vitro. The art is replete with methods for measuring the neutralizing 
ability of various molecules. Techniques include in vitro peripheral blood mononuclear cell (PBMC) 
based assays (D'Souza et al, 1997); measurement of virion attachment (Mondor et al, J. Virol. 
72:3623-3634, 1998); neutral red dye uptake and antigen capture assays (U.S. Patent No. 5,695,927); 
vaccinia-based reporter gene cell fusion assay (Nussbaum et al, J. Virol. 68:541 1-5422, 1994) 
(standard and sCD4 activated assays); productive infection assays (measuring gag antigen p24 or RT 
synthesis) (Kam, HIV: a practical approach. Oxford Univ. Press, Cambridge, 1995); and infectivity 
titer reduction assays (Kam, 1995). 

In addition, physical interaction between gpl20 and CD4 or other CD4-like molecules can 
be examined by various methods. See, for instance U.S. Patent No. 5,843,454 (measuring 
conformational changes of gpl20 on binding of various proteins by vims release and susceptibiHty of 
gpl20 to thrombin-mediated cleavage of the V3 loop). Alternately, the ability of the CD4-like 
molecule to compete for binding to gpl20 with either native CD4 or antibody that recognizes the 
CD4 binding site on gpl20 (CD4BS) can be measured. This will allow the calculation of relative 
binding affinities through standard techniques. 

The invention also includes analogs, derivatives ormimetics of the disclosed fusion 
proteins, and which have HIV neutralizing ability. Such molecules can be screened for HIV 
neutralizing ability by assaying a protein similar to the disclosed fusion protein, in that it has one or 
more conservative amino acid substitutions, or analogs, denvatives or mimetics thereof, and 
determining whether the similar protein, analog, derivative or mimetic provides HIV neutralization. 
The HIV neutralization ability and gpl20 binding affinity of these derivative compounds can be 
measured by any known means, including those discussed in this application 

Injectable composition: A pharmaceutically acceptable fluid composition comprising at 
least one active ingredient, e.g. a bispecific fusion protein. The active ingredient is usually dissolved 
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or suspended in a physiologically acceptable carrier, and the composition can additionally comprise 
minor amounts of one or more non-toxic auxiliary substances, such as emulsifying agents, 
preservatives, and pH buffering agents and the like. Such injectable compositions that are useful for 
use with the fusion proteins of this invention are conventional; formulations are well known in the 
art. 

Isolated: An "isolated" biological component (such as a nucleic acid molecule, protein or 
organelle) is one that has been substantially separated or purified away from other biological 
components in the ceil of the organism in which the component naturally occurs, i.e., other 
chromosomal and extra-chromosomal DNA and RNA, proteins and organelles. Nucleic acids and 
proteins that have been "isolated" include nucleic acids and proteins purified by standard purification 
methods. The term also embraces nucleic acids and proteins prepared by recombinant expression in 
a host cell as well as chemically synthesized nucleic acids. 

Neutralizing antibodies: An antibody that is able to specifically bind to a target protein in 
such a way as to inhibit the subsequent biological functioning of that target protein is said to be 
neutralizing of that biological function. In general, any protein that can perform this type of specific 
blocking activity is considered a neutralizing protein; antibodies are therefore a specific class of 
neutralizing protein. The complex formed by binding of a neutralizing protein to a target protein is 
called a neutralizing complex. 

Antibodies that bind to viruses and bacteria and thereby prevent the binding of these 
pathogens to target host cells are said to neutralize the pathogen. Therefore, antibodies that bind to 
HIV proteins and measurably reduce the ability of the vims to bind to or enter target cells (e.g., T-cells 
or macrophages) are HIV-neutralizing antibodies. In general, HIV neutralizing antibodies can be 
broken down into several different classes dependent on what region of the viral envelope protein the 
antibody binds to. Broad classes of such antibodies include anti-gp41 and anti-gpl20 antibodies. 
There are several antigenic regions on the gpl20 protein that provide epitopes for the natural or 
laboratory generation of HIV neutralizing antibodies (see WO 98/36087). Broadly cross-reactive 
neutralizing antibodies usually interact with relatively invariant regions of Env. 

A prhnary source of neutralizing antibodies is the peripheral blood of patients infected with 
the HIV virus. Such primary isolates can be cloned and/or immortahzed using standard techniques. In 
addition to the isolation of naturally-occurring neutralizing antibodies, procedures specifically directed 
toward their production are known in the art. See U.S. Patent Nos. 5,843,454; 5,695,927; 5,643,756; 
and 5,013,548 for instance. 

Linker: A peptide, usually between two and 150 ammo acid residues in length that serves to 
join two protein domains in a multi-domain fusion protein. Examples of specific hnkers can be found, 
for uistance, in Hennecke et al. (Protein Eng. 1 1:405-410, 1998); and U.S. Patent Nos. 5,767,260 and 
5,856,456. 
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Depending on the domains being joined, and their eventual function in the fusion protein, 
linkers may be from about two to about 150 amino acids in length, though these limits are given as 
general guidance only. The tendency of fusion proteins to form specific and non-specific multimeric 
aggregations is influenced by linker length (Alfthan ^j/., 199^ Protein Eng. 8:725-731, 1998). Thus, 
shorter linkers will tend to promote multimerization, while longer linkers tend to favor maintenance of 
monomeric fusion proteins. Aggregation can also be minimized through the use of specific linker 
sequences, as demonstrated in U.S. Patent No. 5,856,456. 

Linkers may be repetitive or non-repetitive. One classical repetitive linker used in the 
production of single chain Fvs (SCFvs) is the (Gly4Ser)3 (or (GGGGS), or (G4S)3) linker. More 
recently, non-repetitive linkers have been produced, and methods for the random generation of such 
linkers are known (Hennecke et al. Protein Eng. 1 1 :405-410, 1998). In addition, linkers may be 
chosen to have more or less secondary character (e.g. helical character, U.S. Patent No. 5,637,481) 
depending on the conformation desired in the final ftision protein. The more secondary character a 
linker possesses, the more constrained the structure of the fmal fusion protein will be. Therefore, 
substantially flexible linkers that are substantially lacking in secondary structure allow flexion of the 
fusion protein at the linker. 

A linker is capable of retaining a binding domain of a protein in binding proximity of a 
target binding site when the linker is of sufficient length and flexibility to allow specific interaction 
between the binding domain and the target bmding site. In the case of the bispecific fusion proteins 
of this invention, a linker that maintains binding proxunity permits the sequential binding with the 
target of first the mducmg-binding domain of the fusion protein, then the induced-binding domam. 
A linker that maintains the domains of a bispecific fusion protein in binding proximity to a target can 
be considered an operable or functional linker as relates to such a bispecific fusion protein. 

Oiigonucieotlde: A linear polynucleotide sequence of between six and 30O nucleotide bases 
in length. 

Operably linked: A first nucleic acid sequence is operably linked with a second nucleic 
acid sequence when the first nucleic acid sequence is placed in a functional relationship with the 
second nucleic acid sequence. For instance, a promoter is operably linked to a coding sequence if the 
promoter affects the transcription or expression of the coding sequence. Generally, operably linked 
DNA sequences are contiguous and, where necessary to join two protein-coding regions, in the same 
reading frame. 

ORF (open reading frame): A series of nucleotide triplets (codons) coding for amino 
acids without any internal termination codons. These sequences are usually translatable into a 
peptide. 

Parenteral: Administered outside of the intestine, e g., not via the ahmentary tract. 
Generally, parenteral formulations are those that will be administered through any possible mode 
except ingestion. This term especially refers to injections, whether administered intravenously, 
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intiathecaliy, intramuscularly, intraperitoneally, or subcutaneously, and various surface applications 
including intranasal, intradermal, and topical application, for instance. 

Pharmaceutically acceptable carriers: The pharaiaceutically acceptable earners useful in 
this invention are conventional. Remington 's Pharmaceutical Sciences, by E. W. Martin, Mack 
5 Publishing Co., Easton, PA, 15th Edition (1975), describes compositions and formulations suitable 

for pharmaceutical delivery of the fusion proteins herein disclosed. 

In general, the nature of the carrier will depend on the particular mode of administration 
being employed. For instance, parenteral formulations usually comprise injectable fluids that include 
pharmaceutically and physiologically acceptable fluids such as water, physiological saline, balanced 
10 salt solutions, aqueous dextrose, glycerol or the like as a vehicle. For solid compositions (e.g., 

powder, pill, tablet, or capsule forms), conventional non-toxic solid carriers can include, for example, 
pharmaceutical grades of mannitol, lactose, starch, or magnesium stearate. In addition to 
biologically-neutral carriers, pharmaceutical compositions to be administered can contain minor 
amounts of non-toxic auxiliary substances, such as wetting or emulsifying agents, preservatives, and 
15 pH buffering agents and the like, for example sodium acetate or sorbitan monolaurate. 

Purified: The term purified does not require absolute purity; rather, it is intended as a 
relative term. Thus, for example, a purified fusion protein preparation is one in which the fusion 
protein is more enriched than the protein is in its generative environment, for instance within a cell or 
in a biochemical reaction chamber. In some embodiments, a preparation of fusion protein is purified 
,20 such that the fusion protein represents at least 50% of the total protein content of the preparation. 

Recombinant: A recombinant nucleic acid molecule is one that has a sequence that is not 
naturally occurring or has a sequence that is made by an artificial combination of two otherwise 
separated segments of sequence. This artificial combination can be accomplished by chemical 
synthesis or, more commonly, by the artificial manipulation of isolated segments of nucleic acids, 
25 e.g., by genetic engineering techniques. 

Similarly, a recombinant protein is one encoded for by a recombinant nucleic acid molecule. 
Sequence identity: The similarity between two nucleic acid sequences, or two amino acid 
sequences is expressed in terms of the similarity between the sequences, otherwise referred to as 
sequence identity. Sequence identity is frequently measured in terms of percentage identity (or 
30 similarity or homology); the higher the percentage, the more similar the two sequences are. Homologs 

of the bispecific fusion protein will possess a relatively high degree of sequence identity when aligned 
using standard methods. 

Methods of alignment of sequences for comparison are well known in the art. Various 
programs and alignment algorithms are described in: Smith and Waterman {Adv. Appl Math. 2: 482, 
35 1981); Needleman and Wunsch (J. Mol. Biol. 48: 443-453, 1970); Pearson and Lipman (Proc. Natl. 

Acad. Set, USA 85:2444-2448, 1988); Higgins and Sharp (Gene, 73:237-244, 1988); Higgins and 
Sharp (Oi^/6>5 5: 15 1-153, 1989); Corpet et al. (Nuc. Acids Res. 16: 10881-10890, 1988); Huang e/ a/. 
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{Comp. Appls. Biosci. 8:155-165, 1992); and Pearson et al. {Methods in Molecular Biology' 24: 307- 
331, 1994). Altschul etal. {Nature Genet., 6:119-129, 1994) presents a detailed consideration of 
sequence alignment methods and homology calculations. 

The alignment tools ALIGN (Myers and Miller, CABIOS 4:1 1-17, 1989) or LFASTA {Pearson 
5 and Lipman, Proc. Natl. Acad. Sci., USA 85:2444-2448, 1988) may be used to perform sequence 

comparisons (Internet Program © 1996, W. R. Pearson and the University of Virginia, "fasta20u63" 
version 2.0u63, release date December 1996). ALIGN compares entire sequences against one another, 
while LFASTA compares regions of local similarity. These alignment tools and their respective 
tutorials are available on the Internet at http: //biology .nc sa .uiuc . edu . 

10 Orthologs of the disclosed bispecific fusion proteins are typically characterized by possession 

of greater than 75% sequence Identity counted over the full-length alignment with the amino acid 
sequence of bispecific fusion protein using ALIGN set to default parameters. 

The NCBI Basic Local Ahgnment Search Tool (BLAST) (Altschul et al., J Mol Biol. 1 990 
215:403-410, 1990) is available from several sources, including the National Center for Biotechnology 

15 Information (NCBI, Bethesda, MD) and on the Internet, for use in connection with the sequence 

analysis programs blastp, biastn, blastx, tblastn and tblastx. It can be accessed at 
http//www.ncbi.nlm.mh.gov/BLAST/. A description of how to determine sequence identity using this 
program is available at http://www.ncbi.nlm.nih.gov/BLAST/blast help.html . 

For comparisons of amino acid sequences of greater than about 30 amino acids, the "Blast 2 

20 sequences" function is employed using the default BLOSUM62 matrix set to default parameters, (gap 
existence cost of 1 1, and a per residue gap cost of 1). When aligning short peptides (fewer than around 
30 amino acids), the alignment should be performed using the Blast 2 sequences function, employing 
the PAM30 matrix set to default parameters (open gap 9, extension gap 1 penalties). Protems with 
even greater similarity to the reference sequences will show mcreasing percentage identities when 

25 assessed by this method, such as at least 90%, at least 92%, at least 94%, at least 95%, at least 97%,, at 
least 98%, or at least 99% sequence identity. In addition, sequence identity can be compared over the 
foil length of one or both binding domains of the disclosed fusion proteins. In such an instance, 
percentage identities will be essentially similar to those discussed for full-length sequence identity. 
When significantly less than the entire sequence is being compared for sequence identity, 

30 homologs will typically possess at least 80% sequence identity over short windows of 10-20 amino 
acids, and may possess sequence identities of at least 85%o, at least 90%, at least 95%, or at least 99% 
depending on their similarity to the reference sequence. Sequence identity over such short windows 
can be determined using LFASTA; methods are described at http://biology.ncsa.uiuc.edu . One of skill 
in the art will appreciate that these sequence identity ranges are provided for guidance only; it is 

35 entirely possible that strongly significant homologs could be obtained that fall outside of the ranges 

provided. The present invention provides not only the peptide homologs that are described above, but 
also nucleic acid molecules that encode such horaoiogs. 
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An alternative indication that two nucleic acid molecules are closely related is that the two 
molecules hybridize to each other under stringent conditions. Stringent conditions are sequence- 
dependent and are different under different environmental parameters. Generally, stringent conditions 
are selected to be about 5° C to 20° C lower than the thermal melting point (T^,) for the specific 
5 sequence at a defined ionic strength and pH. The T„ is the temperature (under defined ionic strength 

and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Conditions for 
nucleic acid hybridization and calculation of stringencies can be found in Sambrook et al. (In 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York, 1989) and Tijssen 
{Laboratoiy Techniques in Biochemistry and Molecular Biology Part I, Ch. 2, Elsevier, New York, 

10 1 993). Nucleic acid molecules that hybridize under stringent conditions to the disclosed bispecific 

fusion protein sequences will typically hybridize to a probe based on either the entire fusion protein 
encoding sequence, an entire binding domain, or other selected portions of the encoding sequence 
under wash conditions of 0.2 x SSC, 0.1% SDS at 65° C. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 

J 5 similar amino acid sequences, due to the degeneracy of the genetic code. It is understood that changes 

in nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences, 
each encoding substantially the same protein. 

Specific binding agent: An agent that binds substantially only to a defined target. Thus a 
gpl20-specific binding agent binds substantially only the gpl20 protein. As used herein, the term 

20 "gpl20-specific binding agenf includes anti-gpl20 antibodies and other agents that bind substantially 

only to a gpl20 protein. 

Anti-gp 1 20 antibodies may be produced using standard procedures described in a number of 
texts, including Harlow and Lane {Using Antibodies, A Laboratory Manual, CSHL, New York, 1999, 
ISBN 0-87969-544-7). In addition, certain techniques may enhance the production of neutralizing 

25 antibodies (U.S. Patents No. 5,843,454; 5,695,927; 5,643,756; and 5,013,548). The determination that 
a particular agent binds substantially only to gpl20 protein may readily be made by using or adapting 
routine procedures. One suitable in vitro assay makes use of the Western blotting procedure (described 
in many standard texts, including Harlow and Lane, 1999). Western blotting may be used to determine 
that a given protein binding agent, such as an anti-gpl20 monoclonal antibody, binds substantially only 

30 to the MSG protein. Antibodies to gpl20 are well known in the art. 

Shorter fragments of antibodies can also serve as specific binding agents. For instance, FAbs, 
Fvs, and single-chain Fvs (SCFvs) that bind to gpl20 would be gpl20-specific binding agents. 

Therapeutically effective amount of a bispecific fusion protein: A quantity of bispecific 
fusion protein sufficient to achieve a desired effect in a subject being treated. For instance, this can 

35 be the amount necessary to inhibit viral proliferation or to measurably neutralize disease organism 

binding mechanisms. In general, this amount will be sufficient to measurably inhibit vims {e.g. HTV) 
replication or mfectivity. 
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An effective amoxint of bispecific fusion protein may be administered in a single dose, or in 
several doses, for example daily, during a course of treatment. However, the effective amount of 
fusion protein m\\ be dependent on the fusion protein applied, the subject being treated, the severity 
and type of the affliction, and the maimer of administration of the fusion protein. For example, a 
5 therapeutically effective amount of fusion protein can vary from about 0.01 mg/kg body weight to 

about 1 g/kg body weight. 

The fusion proteins disclosed in the present invention have equal application in medical and 
veterinary settings. Therefore, the general term "subject being treated" is understood to include all 
animals (e.g. humans, apes, dogs, cats, horses, and cows) that are or may be infected with a virus or 
1 0 other disease-causing microorganism that is susceptible to bispecific fiision protein-mediated 

neutralization. 

Transformed: A transformed cell is a cell mto which has been introduced a nucleic acid 
molecule by molecular biology techniques. As used herein, the term transformation encompasses all 
techniques by which a nucleic acid molecule might be introduced into such a cell, including 
15 transfection with viral vectors, transformation with plasmid vectors, and introduction of naked DNA 

by electroporation, lipofection, and particle gun acceleration. 

Vector: A nucleic acid molecule as introduced into a host cell, thereby producing a 
transformed host cell. A vector may include nucleic acid sequences that permit it to replicate in a 
host cell, such as an origin of replication. A vector may also include one or more selectable marker 
20 genes and other genetic elements known in the art. 

II. Construction, Expression, and Purification of Bispecific Fusion Proteins. 
A. Selection of component domains. 

This invention provides generally a bispecific fusion protein that binds to two different sites 
25 on a target protein. As such, any target protein that has two different binding sites is an example of a 

target for a bispecific fusion protein. Particular targets include proteins on which one of the two 
binding sites (the induced-bindmg site) is exposed/induced by the binding of the fusion protein to a 
first binding site (the inducing-binding site) on the target. The choice of protein binding domains for 
incorporation into the disclosed bispecific fusion protein will be dictated by the target protein chosen. 
30 The choice of linker will also be influenced by the target protein and binding sites chosen. In 

general, the linker used in any bispecific fusion will be of a length and secondary character to hold 
the induced-binding domain within binding proximity of the target protein induced binding site, once 
the inducing-binding domain of the fusion protein has formed a specific complex with the target. 

In certain embodiments, the target protem is an HIV envelope glycoprotein, for instance 
35 HIV- 1 gpl20. In certam of these and other embodiments, the inducing-binding site is the CD4 

binding site on gpl20. As such, the inducing-binding domain of the disclosed bispecific fusion 
protein can be a binding fragment of CD4, for mstance sCD4. Alternately, any other molecule that 
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specifically interacts with gpl20 in such a way as to expose one or more induced epitopes would also 
serve as the source of an inducing-binding protein domain. The specific fragments used to construct 
the fusion protein should be chosen so that the conformation of the final fasion provides functional 
and inducing binding to gpl20; this can be assayed either directly (e.g., affmity measurements) or 
5 indirectly (e.g., neutralization assays). 

Non-CD4-derived CD4 mimics may also be employed as sources for inducing-binding 
domains of the disclosed fusion proteins. For instance, a maimose-specific lectin (SC) may serve to 
induce CD4 induced conformational changes (see U.S. Patent No. 5,843,454). Alternatively, 
antibodies that bind the CD4-binding site or another epitope of gpl20 and thereby induce a CD4-like 
10 conformational change on the complexed protein can also be used. 

Non-peptide CD4 analogs can also be used in this invention, for instance organic or non- 
organic structural analog of the gpl20-interacting domain(s) of the CD4 molecule. 
1 Induced-binding domains of a gpl20-targeted fusion protein will include antibodies (or 

fragments thereof) that recognize induced epitopes of the complexed gpl20. In some embodiments, 
15 such antibodies are broadly cross-reactive against diverse HIV-1 isolates. Induced epitopes include 

all of those referred to as CD4-induced (CD4i) epitopes, and in particular those which overlap 
coreceptor-binding determinants of gpl20. Previously identified neutralizing monoclonal antibodies 
can be used, and include but are not limited to human monoclonal antibodies 1 7b, 48d, and CG 1 0. 

Likewise, induced binding domains of the disclosed chimeric molecules can be non-peptide 
20 molecules, for instance organic or non-organic structural analogs of SCFv(l 7b). 

In addition to antibodies that bind induced epitopes of gpl20, other sources for induced- 
? binding domains include fragments of coreceptors that specifically interact with a coreceptor binding 

domain(s) of gpl20. 

The construction of a gpl20-specific bispecific fusion protein can be aided by review of the 
25 X-ray crystaliographic structure of the ternary complex containing the gpl20 core, a two-domain 

fragment of CD4 (D1D2), and an FAb from a broadly cross-reactive human MAb (17b) directed 
against the coreceptor-binding determinants of gpl20 (Kwong et a!.. Nature 393:648-659, 1998). 
Computer-based examination of the structural coordinates of this ternary complex, using FRODO 
(Jones et al., Meth. Enzymol. 115:157-171, 1985; lones, J. Appl. Cryst. 11:268-272, 1978; Pflugrath 
30 et al. Methods and Applications in Crystallography, pages 407-420, Clarendon Press, Oxford), has 

revealed choices for constructing the chimeric protein. The shortest distance between free termini of 
CD4 and the 17b FAb is 56 A, i.e. from the free C-terminus of the D1D2 sCD4 fragment to the N- 
termmus of the 17b FAb heavy chain. A linker connecting these termini would be essentially free of 
steric hindrance from CD4 and the N-terminus of the 1 7b light chain. Possible connections could 
35 also be made between the N-terminus of CD4 and the C-termini of the 17b heavy or light chains; 

such connections would require linkers of about 65 A and about 86 A, respectively. In the latter two 
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connections the linker is required to circumvent other portions of the complex, including the bulky 
variable loops. 

B. Assembly. 

The construction of chimeric molecules, in particular fusion proteins, from domains of 
known proteins is well known. In general, a nucleic acid molecule that encodes the desired protein 
domains are joined using standard genetic engineering techniques to create a single, operably linked 
fusion oligonucleotide. Molecular biological techniques may be found in Sambrook et al. (In 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York, 1989). Specific 
examples of genetically engineered multi-domain proteins, especially those based on molecules of 
the immunoglobulin superfamily, joined by various linkers, can be found in the following patent 
documents: 

U.S. Patent No. 5,856,456 ("Linker for linked fusion polypeptides"); 

U.S. Patent No. 5,696,237 ("Recombinant antibody-toxin fusion protein"); 

U.S. Patent No. 5,767,260 ("Antigen-binding fusion protems"); 

U.S. Patent No. 5,587,455 ("Cytotoxic agent against specific virus infection"); and 

WO 98/36087 ("Immunological tolerance to HfV epitopes"). 

Non-peptide analogs that serve as inducing-binding or induced binding domains of the 
invention can be linked to the opposite domain of the chimeric molecules using known chemical 
linking techniques, including chemical cross-linking. Cross-linkers are well known, and examples of 
molecules used for cross-linkmg can be found, for instance, in U.S. Patent No. 6,027,890 ("Methods 
and compositions for enhancing sensitivity in the analysis of biological-based assays"). 

C. Expression. 

One skilled in the art will understand that there are myriad ways to express a recombinant 
protein such that it can subsequently be purified. In general, an expression vector carrying the 
nucleic acid sequence that encodes the desired protein will be transformed into a microorganism for 
expression. Such microorganisms can be prokaryotic (bacteria) or eukaryotic {e.g., yeast). One 
example species of bacteria that can be used is Escherichia coli (E. coli), which has been used 
extensively as a laboratory experimental expression system. An eukaryotic expression system can be 
used where the protein of interest requires eukaryote-specific post-translational modifications such as 
glycosylation. Also, protein can be expressed using a viral {e.g., vaccinia) based expression system. 

Protein can also be expressed in animal cell tissue culture, and such a system can be used 
where animal-specific protein modifications are desirable or required in the recombinant protein. 

The expression vector can include a sequence encoding a targeting peptide, positioned in 
such a way as to be fused to the coding sequence of the bispecific fusion protein. This allows the 
bispecific fusion protein to be targeted to specific sub-cellular or extra-cellular locations. Various 
prokaryotic and eukaryotic targeting peptides, and nucleic acid molecules encoding such, are known. 
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In a prokaryotic expression system, a signal sequence can be used to secrete the newly synthesized 
protein. In an eukaryotic expression system, the targeting peptide would specify targeting of the 
hybrid protein to one or more specific sub-cellular compartments, or to be secreted from the cell, 
depending on which peptide is chosen. Through the use of an eukaryotic secretion signal sequence, 
the bispecific fusion protein can be expressed in a transgenic animal (for instance a cow, pig, or 
sheep) in such a manner that the protein is secreted into the milk of the animal. 

Vectors suitable for stable transformation of culturable cells are also well known. Typically, 
such vectors include a multiple-cloning site suitable for inserting a cloned nucleic acid molecule, 
such that it will be under the transcriptional control of 5' and 3' regulatory sequences. In addition, 
transformation vectors include one or more selectable markers; for bacterial transformation this is 
often an antibiotic resistance gene. Such transformation vectors typically also contain a promoter 
regulatory region (e.g., a regulatory region controlling inducible or constitutive expression), a 
transcription uiitiation start site, a ribosome binding site, an RNA processing signal, and a 
transcription termination site, each functionally arranged in relation to the multiple-cloning site. For 
production of large amounts of recombinant proteins, an inducible promoter can be used. This 
permits selective production of the recombinant protein, and allows both higher levels of production 
than constitutive promoters, and enables the production of recombinant proteins that may be toxic to 
the expressing cell if expressed constitutively. 

In addition to these general guidelines, protein expression/purification kits are produced 
commercially. See, for instance, the QIAexpress™ expression system from QIAGEN (Chatsworth, 
CA) and various expression systems provided by INVITROGEN (Carlsbad, CA). Depending on die 
details provided by the manufactures, such kits can be used for production and purification of the 
disclosed bispecific fusion proteins. 

D. Purification. 

One skilled in the art will understand that there are myriad ways to purify recombinant 
polypeptides, and such typical methods of protein purification may be used to purify the disclosed 
bispecific fusion proteins. Such methods include, for instance, protein chromatographic methods 
including ion exchange, gel filtration, HPLC, monoclonal antibody affinity chromatography and 
isolation of insoluble protein inclusion bodies after over production. In addition, purification 
affinity-tags, for instance a six-histidine sequence, may be recombinantly fused to the protein and 
used to facilitate polypeptide purification. A specific proteolytic site, for instance a thrombin- 
specific digestion site, can be engineered into the protein between the tag and the fusion itself to 
facilitate removal of the tag after purification. 

Commercially produced protein expression/purification kits provide tailored protocols for 
the purification of protems made using each system. See, for instance, the QIAexpress™ expression 
system from QIAGEN (Chatsworth, CA) and various expression systems provided by 
INVITROGEN (Carlsbad, CA). Where a commercial kit is employed to produce a bispecific fusion 
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protein, the manufacturer's purification protocol is a particularly disclosed protocol for purification 
of that protein. For instance, proteins expressed with an ammo-terminal hexa-his tag can be purified 
by binding to nickei-nitrilotriacetic acid (Ni-NTA) metal affinity chromatography matrix {The 
QIAexpressionist, QIAGEN, 1997). 
5 Alternately, the binding specificities of either the first or second binding domains, or both, 

of the disclosed fusion protein may be exploited to facilitate specific purification of the proteins. 
One method of perfoiming such specific purification would be column chromatography using 
column resin to which the target molecule, or an epitope or fragment or domain of the target 
molecule, has been attached. 
10 If the bispecific fusion protein is produced in a secreted form, e.g. secreted into the milk of a 

transgenic animal, purification will be from the secreted fluid. Alternately, purification may be 
unnecessary if the fusion protein can be applied directly to the subject in the secreted fluid {e.g. 
milk). 

15 III. Variation of a Bispecific Fusion Protein 

A. Sequence Variants 

The binding characteristics and therefore neutralizing activity of the fusion proteins 
disclosed herein lies not in the precise amino acid sequence, but rather in the three-dimensional 
structure inherent in the amino acid sequences encoded by the DNA sequences. It is possible to 

20 recreate the binding characteristics of any of these proteins or protein domains of this invention by 

recreating the three-dimensional stmcture, without necessarily recreating the exact amino acid 
sequence. This can be achieved by designing a nucleic acid sequence that encodes for the three- 
dimensional structure, but which differs, for instance by reason of the redundancy of the genetic 
code. Similarly, the DNA sequence may also be varied, while still producing a fimctional 

25 neutralizing protein. 

Variant neutralizing bispecific binding proteins include proteins that differ in amino acid 
sequence from the disclosed sequence, but that share structurally significant sequence homology with 
any of the provided proteins. Variation can occur in any single domain of the fusion protein (e.g. the 
first or second binding domain, or the linker). Variation can also occur in more than one of such 

30 domains in any particular variant protein. Such variants may be produced by manipulating the 
nucleotide sequence of the, for instance a CD4-SCFv(17b)-encoding sequence, using standard 
procedures, including site-directed mutagenesis or PCR. The simplest modifications involve the 
substitution of one or more amino acids for amino acids having similar biochemical properties. 
These so-called conservative substitutions are likely to have minimal impact on the activity of the 

35 resultant protem, especially when made outside of the binding site of each domain. Table 1 shows 

amino acids that may be substituted for an original amino acid in a protein, and which are regarded as 
conservative substitutions. 
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Original Residue 
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More substantial changes in protein structure may be obtained by selecting amino acid 
substitutions that are less conservative than those listed in Table 1 . Such changes include changing 
residues that differ more significantly in their effect on maintaining polypeptide backbone structure 
(e.g., sheet or helical conformation) near the substitution, charge or hydrophobicity of the molecule 
at the target site, or bulk of a specific side chain. The following substitutions are generally expected 
to produce the greatest changes in protein properties; (a) a hydrophilic residue (e.g., seryl or 
threonyl) is substituted for (or by) a hydrophobic residue (e.g., leucyl, isoleucyl, phenylalanyl, vaiyl 
or alanyl); (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue having an 
electropositive side chain (e.g., lysyl, arginyl, or histadyl) is substituted for (or by) an electronegative 
residue (e.g., glutamyl or aspartyl); or (d) a residue having a bulky side chain (e.g., phenylalanine) is 
substituted for (or by) one lacking a side chain (e.g. , glycine). 

Variant binding domain or fusion protein-encoding sequences may be produced by standard 
DNA mutagenesis techniques, for example, M13 primer mutagenesis. Details of these techniques are 
provided in Sambrook (In Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New 
York, 1989), Ch. 15. By the use of such techniques, variants may be created which differ in mmor 
ways from the bispecific tusion protein-encoding sequences disclosed. DNA molecules and 
nucleotide sequences which are derivatives of those specifically disclosed herein and that differ from 
those disclosed by the deletion, addition, or substitution of nucleotides while still encoding a protein 
that binds twice to gpl20, thereby neutralizing HIV vims mfectivity, are comprehended by this 
invention. In their most simple form, such variants may differ from the disclosed sequences by 
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alteration of the coding region to fit the codon usage bias of the particular organism into which the 
molecule is to be introduced. 

Alternatively, the coding region may be altered by taking advantage of the degeneracy of 
the genetic code to alter the codmg sequence such that, while the nucleotide sequence is substantially 
altered, it nevertheless encodes a protein having an amino acid sequence substantially similar to the 
disclosed fusion sequences. For example, the 18th amino acid residue of the CD4-SCFv( 1 7b) protein 
(after cleavage of the N-terminal signal sequence) is alanine. The nucleotide codon triplet GCT 
encodes this alanine residue. Because of the degeneracy of the genetic code, three other nucleotide 
codon triplets - (GCG, GCC and GCA) - also code for alanine. Thus, the nucleotide sequence of the 
disclosed CD4-SCFv(17b)encoding sequence could be changed at this position to any of these three 
alternative codons without affecting the amino acid composition or characteristics of the encoded 
protein. Based upon the degeneracy of the genetic code, variant DNA molecules may be derived 
from the cDNA and gene sequences disclosed herein using standard DNA mutagenesis techniques as 
described above, or by synthesis of DNA sequences. Thus, this invention also encompasses nucleic 
acid sequences which encode a neutralizing bispecific fusion protein, but which vary from the 
disclosed nucleic acid sequences by virtue of the degeneracy of the genetic code. 

B. Peptide Modifications 

The present invention includes biologically active molecules that mimic the action of the 
bispecific fusion proteins of the present invention, and specifically neutralize HIV Env. The proteins 
of the invention include synthetic embodiments of naturally-occurring proteins described herein, as 
well as analogues (non-peptide organic molecules), derivatives (chemically functionalized protein 
molecules obtained starting with the disclosed peptide sequences) and variants (homologs) of these 
proteins that specifically bind with and neutralize HIV gpl20. Each protein of the invention is 
comprised of a sequence of amino acids, which may be either L- and/or D- amino acids, naturally 
occuiring and otherwise. 

Proteins may be modified by a variety of chemical techniques to produce derivatives having 
essentially the same activity as the unmodified proteins, and optionally having other desirable 
properties. For example, carboxylic acid groups of the protein, whether carboxyl-terminal or side 
chain, may be provided in the form of a salt of a pharmaceutically-acceptable cation or esterified to 
form a C,-Ci6 ester, or converted to an amide of formula NR.R, wherein R, and R, are each 
independently H or C,-C,(, alkyl, or combined to form a heterocyclic ring, such as a 5- or 6- 
membered ring. Amino groups of the protein, whether amino-terminal or side chain, may be in the 
form of a pharmaceufically-acceptable acid addition salt, such as the HCl, HBr, acetic, benzoic, 
toluene sulfonic, maleic, tartaric and other organic salts, or may be modified to C,-C,g alkyl or 
dialkyi amino or further converted to an amide. 

Hydroxyl groups of the protein side chains may be converted to C,-C,6 alkoxy or to a C.-C,^ 
ester using well-recognized techniques. Phenyl and phenolic rings of the protein side chains may be 
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substituted with one or more halogen atoms, such as fluorine, chlorine, bromme or iodine, or with C,- 
C,6 alkyl, C,-C,6 alkoxy, carboxylic acids and esters thereof, or amides of such carboxylic acids. 
Methylene groups of the protein side chains can be extended to homologous Cj-C^ alkylenes. Thiols 
can be protected with any one of a number of well-recognized protecting groups, such as acetamide 
groups. Those skilled in the art will also recognize methods for introducing cyclic structures into the 
protems of this invention to select and provide conformational constraints to the structure that result 
in enhanced stability. 

It also may be advantageous to introduce one or more disulfide bonds to connect the 
frameworks of the heavy and light chains in the SCFv domain. This modification often enhances the 
stability and affinity of SCFvs (Reiter et al. Protein Engineering 7:697-704, 1994). Here too, the X- 
ray crystal structure containing the 17 FAb (Kwong et al. Nature 393:648-659, 1998) can be used to 
assess optmial sites for engineering cysteine residues of the heavy and light chains. 

Peptidomimetic and organomimetic embodiments are also within the scope of the present 
invention, whereby the three-dimensional arrangement of the chemical constituents of such peptido- 
and organomimetics mimic the three-dimensional arrangement of the protein backbone and 
component amino acid side chains in the bispecific neutralizing fusion protein, resultmg in such 
peptide- and organomimetics of die proteins of this invention having measurable or enhanced 
neutralizing ability. For computer modeling applications, a pharmacophore is an idealized, three- 
dimensional definition of the structural requirements for biological activity. Peptide- and 
organomimetics can be designed to fit each pharmacophore with current computer modeling software 
(using computer assisted drug design or CADD). See Walters, "Computer- Assisted Modeling of 
Dmgs", in Kiegerman & Groves, eds., 1993, Pharmaceutical Biotechnology. Interpharm Press: 
Buffalo Grove, IL, pp. 165-174 and Principles of Pharmacology Munson (ed.) 1995, Ch. 102, for 
descriptions of techniques used in CADD. Also included within the scope of the invention are 
mimetics prepared using such techniques that produce neutralizing fusion proteins. 
C. Domain length variation. 

It will be appreciated that the protein domains of the current invention may be combined to 
produce fusion protein molecules without necessarily splicing the components in the same place. It 
is beheved to be possible to use shorter or longer fragments of each component domain, linked by a 
functional linker. For instance, any component which is spliced within about 10 amino acid residues 
of the residue specified, and which still provides a functional binding fragment, comprises about the 
same domain. However, domains of substantially longer or substantially shorter length can be used. 
For instance, in certain embodiments, the protein can include a leader sequence plus a four-domain 
CD4 (D1-D4, amino acid residues 1-372), or just the first domain of CD4 (Dl residues 1-113). 
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IV. Activity of Fusion Proteins 

It is important to assess the chemical, physical and biological activity of the disclosed 
bispecific fusion proteins. Among other uses, such assays permit optimization of the domains 
chosen, as well as optimization of the length and conformation of the linkers used to connect them. 
5 Control molecules should be included in each assay; usually such will include each domain alone, as 

well as the two domains as separate molecules mixed in the reaction, for instance in a 1 : 1 molar ratio. 
In the case of a CD4-SCFv(17b) bispecific fusion protein, such controls would include sCD4 and 
SCFv(17b), for instance. 

A. Fusion protein affinity for target protein 

1 0 Fusion protein affinity for the target protein can be determined using various techniques. 

For instance, co-immunoprecipitation analyses with metabolically labeled proteins can be employed 
to determine binding of sCD4-SCFv proteins, e.g. sCD4-SCFv(17b) to soluble HIV-1 gpl20, using 
anti-gpl20 MAbs that do not interfere with CD4 binding (e.g. MAb D47 that binds to V3), or 
polyclonal antibody to the C-terminus of gpl20. ELISA can also be used to examine the binding 

1 5 characteristics of each domain of the chimera. 

B. Neutralization assays 

Various assays can be used to measure the ability of the disclosed fusion proteins to inhibit 
function of the target protein. Individual components of the fusion protein will serve as controls. In 
general, assays will be specific for the target/fusion protein. For instance, many functional analyses 

20 can test the ability of sCD4-SCFv fusions to neutralize the HIV Env. It is particularly advantageous 

to use Envs from diverse HIV-1 strains to test the breadth of inhibition (neutralizing ability) of each 
fusion protein for different HIV-1 genetic subtypes and different phenotypes {i.e. coreceptor usage). 
In addition, it is advantageous to test such fusion proteins in the standard and sCD4-activated assays 
for Env-mediated ceil fusion. Knovra HIV-1 neutralizing MAbs and MAbs against CD4-induced 

25 epitopes on gpl20 are examples of controls for such experiments. Possible synergistic inhibition 

with other known broadly cross-reactive neutralizing MAbs should be tested (e.g. bl2, 2F5, F105, 
2G12). 

In the case of gpl20-targeted fusion proteins, the vaccinia-based reporter gene cell fusion 
assay may be used to assess fusion inhibition (Nussbaum et al, J. Virol. 68:541 1-5422, 1994). One 

30 population of tissue culture cells {e.g. BS-C-1, HeLa, or NIH 3T3) uniformly expressmg vaccinia 

virus-encoded binding and fusion-mediating viral envelope glycoprotein(s) is mixed with another 
population expressing the corresponding cellular receptor(s). In the case of sCD4-SCFv fusions, 
where the target protein is HIV- 1 gpl20, one cell population expresses HIV-1 Env, while the other 
expresses necessary HIV-1 receptors {e.g. CD4 and a chemokine receptor). The cytoplasm of either 

35 cell population also contains vaccinia virus-encoded bacteriophage T7 DNA polymerase; the 

cytoplasm of the other contams a transfected plasmid with the E. coH lacZ gene linked to the T7 
promoter. Upon mixing of the two populanons, cell fusion results m activation of the lacZ gene, 
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thiough the introduction of the T7 RNA polymerase into proximity with the transfected T7 promoter- 
lacZ in the cytoplasm of the fused cells. The resultant B-galactosidase (6-gal) activity is proportional 
to the amount of fusion that occurs, and can be measured by colorimetric assay of detergent cell 
lysates or in situ staining. Cell-fusion neutralizing activity of bispecific fusion proteins is therefore 
5 assessed by measuring their inhibition of B-gal production. 

The gpl20-targeted fusions {e.g. sCD4-SCFv) can also be tested for ability to block HIV-1 
infection using single round assays {e.g. using indicator cell lines, Vodicka et ai. Virology 233:193- 
198, 1997). Target cells expressing CD4 and a specific coreceptor, and containing the lacZ reporter 
gene linked to the HIV-1 long terminal repeat (LTR), are infected with specific HlV-1 strains 
10 (Vodicka, 1997). Integration of an HIV provirus in these cells leads to production of the viral 

transactivator. Tat, which then tums on expression of the 6-gal gene via interaction with LTR. The 
activity of sCD4-SCFv is assessed by its inhibition of production of 6-gal-positive cells (stained blue 
with X-gal), which is proportional to its ability to block HIV-1 infection. 

15 V. Incorporation of Bispecific Fusion Proteins into Pharmaceutical Compositions 

hi Pharmaceutical compositions that comprise at least one bispecific fusion protein as 

ij described herein as an active ingredient will normally be formulated with a solid or liquid carrier, 

^ depending upon the particular mode of administration chosen. The pharmaceutically acceptable 

n carriers and excipients use&l in this invention are conventional. For instance, parenteral 

•to formulations usually comprise injectable fluids that are pharmaceutically and physiologically 

i ; I acceptable fluid vehicles such as water, physiological saline, other balanced salt solutions, aqueous 

IJ dextrose, glycerol or the like. Excipients that can be included are, for instance, other proteins, such 

as human seram albumin or plasma preparations. If desired, the pharmaceutical composition to be 
administered may also contain minor amounts of non-toxic auxiliary substances, such as wetting or 
25 emulsifying agents, preservatives, and pH buffering agents and the like, for example sodium acetate 
or sorbitan monolaurate. 

Other medicinal and pharmaceutical agents, for instance nucleoside derivatives (e.g. AZT) 
or protease inhibitors, also may be included. It may also be advantageous to include other fusion 
inhibitors, for instance one or more neutralizing antibodies. 
30 The dosage form of the pharmaceutical composition will be determined by the mode of 

administration chosen. For instance, in addition to injectable fluids, topical and oral formulations can 
be employed. Topical preparations can include eye drops, ointments, sprays and the like. Oral 
formulations may be liquid (e.g., syrups, solutions or suspensions), or solid {e.g., powders, pills, 
tablets, or capsules). For solid compositions, conventional non-toxic solid carriers can include 
35 pharmaceutical grades of mannitol, lactose, starch, or magnesium stearate. Actual methods of 

prepanng such dosage forms are known, or will be apparent, to those skilled in the art. 
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The pharmaceutical compositions that comprise bispecific fusion protein may be formulated 
in unit dosage form, suitable for individual administration of precise dosages. One possible unit 
dosage contains approximately 100 \ig of protein. The amount of active compound administered will 
be dependent on the subject being treated, the severity of the affliction, and the manner of 
5 administration, and is best left to the judgment of the prescribing clinician. Within these bounds, the 
formulation to be administered will contain a quantity of the active component{s) in an amount 
effective to achieve the desired effect in the subject being treated. 

VI. Clinical Use of Bispecific Fusion Proteins 

1 0 The potent viral-neutralizing activity exhibited by the disclosed bispecific fusion proteins 

makes them useful for treating viral infections in human and other animal subjects. Possibly 
susceptible viruses include the inamunodeficiency viruses, such as HIV and similar or related vimses 
"if in simians and other animals. In addition, other viral or microbial systems that involve the 

d"| interaction of a first inducing and second induced binding site of a single protein will also be 

^Hf5 susceptible to neutralization using bispecific fusion proteins of the current invention. The bispecific 

I fusion proteins disclosed herein can also be used in highly sensitive detection or purification of target 

protein. 

The bispecific fusion proteins of this invention may be administered to humans, or other 
C;| animals on whose cells they are effective, in various manners such as topically, orally, intravenously, 

.2<) intramuscularly, intraperitoneally, intranasally, intradermally, intrathecally, and subcutaneously. The 

h I particular mode of administration and the dosage regimen will be selected by the attending clinician, 

I rf- taking into account the particulars of the case {e.g., the subject, the disease, and the disease state 

involved, and whether the treatment is prophylactic or post-infection). Treatment may involve daily 
or multi-daily doses of bispecific fusion protein(s) over a period of a few days to months, or even 
25 years. 

If treatment is through the direct administration of cells expressing the bispecific fiision 
protein to the subject, such cells (e.g. transgenic pluripotent or hematopoietic stem cells or B cells) 
may be administered at a dose of between about 10* and 10'" cells, on one or several occasions. The 
number of cells will depend on the patient, as well as the fusion protein and cells chosen to express 
30 the protein. 

A general strategy for transferring genes into donor cells is disclosed in U.S. Patent No. 
5,529,774, which is incorporated by reference. Generally, a gene encodmg a protein having 
therapeutically desired effects is cloned into a viral expression vector, and that vector is then introduced 
into the target organism. The virus infects the cells, and produces the protem sequence in vivo, where it 
35 has its desired therapeutic effect. See, for example, Zabner et al. Cell 75:207-216, 1993. As an 

alternative to adding the sequences encoding the bispecific fiision protein or a homologous protem to the 
DNA of a vims, it is also possible to introduce such a gene into the somatic DNA of infected or 
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uninfected cells, by methods that are well known in the art (Sambrook et al. In Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor, New York, 1989). These methods can be used to introduce 
the herein disclosed fusion proteins to human cells to provide long-term resistance to HIV-1 mfection or 
AIDS. For example, gene therapy can be used to secrete the protein at mucosal surfaces, such as the 
5 vaginal, rectal, or oral mucosa. 

HIV-1 gpl20-targeted bispecific fusion proteins, for instance sCD4-SCFv( 1 7b), are 
particularly useful in the prevention of infection during or immediately after HIV exposure (e.g., 
mother/infant transmission, post-exposure prophylaxis, and as a topical inhibitor). In such instances, 
one or more doses of the bispecific fusion protein are administered before or soon after the triggering 

10 event. To prevent or ameliorate mother/infant transmission of viral infection, for instance, it may be 

beneficial to administer the gpl20-targeted bispecific fiision protein to the mother intermittently 
throughout pregnancy, and/or immediately before or following delivery, and/or directly to the 
newborn unmediately after birth. Post-exposure prophylactic treatments may be particularly 
beneficial where there has been accidental exposure (for instance, a medically related accidental 

15 exposure), including but not limited to a contaminated needle-stick or medical exposure to HlV-1 

contaminated blood or other fluid. 

The present invention also includes combinations of chimeric bispecific fusion proteins with 
one or more other agents useful in the treatment of disease, e.g. HIV disease. For example, the 
compounds of this invention may be administered, whether before or after exposure to the virus, in 

20 combination with effective doses of other anti-virals, immunomodulators, anti-infectives, and/or 

vaccines. The term "administration in combination" refers to both concurrent and sequential 
administration of the active agents. 

Examples of antiviral agents that can be used in combination with the chimeric bispecific 
fusion proteins of the invention are: AL-721 (from Ethigen of Los Angeles, CA), recombinant 

25 human interferon beta (from Triton Biosciences of Alameda, CA), Acemannan (from Carrington 

Labs of Irving, TX), gangiclovir (from Syntex of Palo alto, CA), didehydrodeoxythymidine or d4T 
(from Bristol-Myers-Squibb), EL10 (from Elan Corp. of Gainesville, GA), dideoxycytidine or ddC 
(from Hoffman-LaRoche), Novapren (from Novaferon labs. Inc. of Akron, OH), zidovudine or AZT 
(from Burroughs Wellcome), ribaririn (from Viratek of Costa Mesa, CA), alpha interferon and 

30 acyclovir (from Burroughs Wellcome), Indinavir (from Merck & Co.), 3TC (from Glaxo Wellcome), 
Ritonavir (from Abbott), Saquinavir (from Hoffmann-LaRoche), and others. 

Examples of immuno-modulators that can be used m combination with the chimeric 
bispecific fusion proteins of the invention are AS-101 (Wyeth-Ayerst Labs.), bropirimine (Upjohn), 
gamma interferon (Genentech), GM-CSF (Genetics Institute), IL-2 (Cetus or Hoffman-LaRoche), 

35 human immune globulin (Cutter Biological), IMREG (from Imreg of New Orleans, La.), 

SK&F106528, and TNF (Genentech). 
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Examples of some anti-infectives with which the chimeric bispecific fusion proteins can be 
used include cUndamycin with primaquine (from Upjohn, for the treatment of Pneumocystis 
pneumonia), fiuconazlone (from Pfizer for the treatment of cryptococcal meningitis or candidiasis), 
nystatin, pentamidine, trimethaprim-sulfamethoxazole, and many others. 
5 The combination therapies are of course not limited to the lists provided in these examples, 

but includes any composition for the treatment of HIV disease (including treatment of AIDS). 

VII. Kits 

The chimeric proteins disclosed herein can be supplied in the form of a kit for use in 
10 prevention and/or treatment of diseases {e.g., HIV infection and AIDS). In such a kit, a clinically 

effective amount of one or more of the chimeric bispecific fusion proteins is provided in one or more 
containers. The chimeric bispecific fusion proteins may be provided suspended in an aqueous 
solution or as a freeze-dried or lyophilized powder, for instance. In certain embodiments, the 
. r chimeric proteins will be provided in the form of a pharmaceutical composition. 

15 Kits according to this invention can also include instructions, usually written instructions, to 

assist the user in treating a disease {e.g., HIV infection or AIDS) with a chimeric bispecific fusion 
protein. Such instructions can optionally be provided on a computer readable medium. 

The container(s) in which the protein(s) are supplied can be any conventional container that 
is capable of holding the supplied form, for instance, microfuge tubes, ampoules, or bottles. In some 
20 applications, chimeric proteins may be provided in pre-measured single use amounts in individual, 

typically disposable, tubes or equivalent containers. 

The amount of a chimeric bispecific fusion protein supplied in the kit can be any appropriate 
amount, depending for instance on the market to which the product is directed. For instance, if the 
kit is adapted for research or clinical use, the amount of each chimeric protein provided would likely 
25 be an amount sufficient for several treatments. 

Certain kits according to this invention will also include one or more other agents useful in 
the treatment of disease, e.g. HIV disease. For example, such kits may include one or more effective 
doses of other anti-virals, immunomodulators, anti-infectives, and/or vaccines. 

30 EXAMPLE 1 

Construction of a CD4-SCFv(17b) encoding sequence 

A gpl20-taTgeted fusion protein, sCD4-SCFv( 1 7b), is constructed by linking the C-terminus 
of CD4 (D1D2, 183 amino acid residues) to the N-terminus of the heavy chain of the 17b SCFv, 
which contains the heavy chain at its N-terminus, linked via its C-terminus to the N-terminus of the 
35 light chain (see schematic diagram of the construct, Fig. 3). The 17b SCFv DNA was obtained from 
R. Wyatt and J. Sodroski, Dana Farber Cancer Instimte, Boston, MA. The 17b-MAb producing- 
hybridoma was obtained from J. Robinson, Tulane University. 
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Linkers were chosen to have sufficient length and flexibility to connect the desired protein 
segments without inducing unacceptable torsion. For the SCFv, the 15 amino acid residue sequence 
(Gly4Ser)3 (designated L2) was chosen, which has been employed successfully for production of 
SCFvs. This sequence confers excellent flexibility with minimal aggregation. The linker between 
5 the C- terminus of CD4 and the N-terminus of the SCFv (designated LI ; SEQ ID NO; 2), is seven 
repeats of the same Gly^Ser sequence. Conservative estimates indicate that this 35 amino acid 
residue linker is sufficiently long to allow CD4 and SCFv to bind simultaneously to their respective 
binding sites on gpI20. A schematic of the genetic construct is shown in Fig. 3. A unique BamH I 
restriction site has been introduced within LI to enable the production of constructs with shorter or 
10 longer linkers, and especially to provide negative controls (linkers too short, thereby not allowing 

both the CD4 and SCFv moieties of a single molecule to bind simultaneously to their respective 
binding sites on gpl20). 

'•=■;:' The starting CD4 plasmid is pCB-3, which contains the full-length CD4 cDNA (including 

y;| its natural 5' signal sequence) in the vaccinia expression plasmid pSC59 (Broder & Berger, J. Virol. 

MlS 67:913-926, 1993). This plasmid was digested with Stu I, which cuts near the end of the 2nd domain 

ji of sCD4, and with Spe 1, which cuts within the vector downstream of the CD4 insert and leaves a 5' 

overhang. 

Synthetic oligonucleotides (SEQ ID NO: 1 1) were annealed together to recapitulate the 5' 
end of the second half of the Stu I site (CCT) and the next two bases (CC) of the CD4 cDNA, and to 
M produce an Spe I overhang at the 3 ' end (this site to be destroyed upon ligation into pCB-3). The 

; ; il oligonucleotide sequence reconstmcts the remainder of the second domain of CD4 (through ser,83), 

hj and encodes the 37 amino acid intermediate linker (gly4ser)5gly4thT2ser, followed directly by the 

universal translational termination sequence (UTS) (SEQ ID NO: 6). A BamH I site has been 
deliberately included within the linker near the end of the third (gly^ser) repeat, to enable subsequent 
25 linkage to the 17b SCFv with the exact LI sequence, and to enable modification of linker length. 

The resulting intermediate plasmid is designated pCDl. This constmct was confirmed by DNA 
sequence analysis using standard techniques. To facilitate subsequent procedures, the sCD4-linker 
sequence was recloned into a pSC59 derivative lacking a BamH I site, forming intermediate plasmid 
pCD2. 

30 The starting 17b plasmid containing the 1 7b SCFv cDNA in a plasmid vector (pmt del 0) 

was donated by Dr. Richard Wyatt (Dana Farber Cancer Institute, Boston, MA). The SCFv cDNA is 
constructed with the heavy chain at the 5' segment and light cham at the 3' segment, attached via 
DNA encoding the L2 linker (gly4ser)3. The 17b SCFv construct has a TPA signal sequence at the 5' 
end, and sequences corresponding to a thrombin cleavage site and a hexa-his tag (to facilitate 

35 purification) at the 3' end, followed by a stop codon. A comparable constmct without the thrombin 

cleavage site and hexa-his tag can also be produced. 
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PCR teclmology was used to attach the 1 7b SCFv sequence to the CD4-linker sequence m 
pCD2. Suitable primers are represented in SEQ ID NOs: 7 and 9. The forward (5') primer (SEQ ID 
NO: 7) contains a BamH I site near the 5' end {preceded by an overhang), followed by nucleotides 
that reconstruct the third (gly4ser) plus four additional (gly^ser) repeats; this is followed by 
5 nucleotides exactly corresponding to the start of the 17b heavy chain (excluding the 5' signal 

sequence, beginning at CAG GTG ). The 3 ' primer (SEQ ID NO: 9) begins with convenient 
restriction sites for cloning into pCD2 (Spe I and others), followed by nucleotides exactly 
complementary to the 3' end of the 17b SCFv sequence in pmt del 0 (stop codon, hexa-his tag, and 
thrombin cleavage site). 

10 These primers are used to prime the plasmid vector containing the 1 7b SCFv sequence in 

pmt del 0, and the resultant PGR product digested with BamH I plus a restriction enzyme that cleaves 
at the opposite 3' end {e.g., Spe I). This digested fragment is then force-cloned into pCD2 that has 

fe;f been digested with the same enzymes (BamH I and Spe I). The resulting plasmid (designated herein 

as pCD3) contains the final sCD4-SCFv(17b) construct (with the thrombin cleavage site and hexa-his 

Ills tag) downstream from the strong, synthetic early/late vaccinia promoter in pSC59. There are 

!' J convenient, unique restriction sites on each side of the sCD4-SCFv sequence for possible future 

f 1 cloning steps. 

' The 17b SCFv cDNA (including the 5' signal sequence) also has been excised from the pmt 

t='% del 0 vector by restriction enzyme digestion or PGR, and cloned into the vaccinia expression plasmid 

Wo pSC59 to provide a control construct. 

1:3 EXAMPLE 2 

Expression and Purification of CD4-SCFv(17b) fusion protein 
A. Expression 

25 For small amounts of protein expression, vaccinia expression technology can be used to 

produce the sCD4-SCFv(17b) (as well as the control 17b SCFv protein). The plasmid containing the 
construct in the vaccinia expression plasmid pSC59 is used to produce a vaccinia recombinant, using 
standard technology. For such expression, suitable cells (HeLa, BSC-1, etc.) are infected with the 
recombinant vaccinia vims; after incubation for 24-36 hours at 37° C, the recombinant protein is 

30 present in the culmre supernatant. Initial biochemical and functional studies can be done with 

unfractionated supernatant; where necessary, the sCD4-SCFv protein may be purified (see below). 
Small scale, initial experiments can be performed with small amounts of material (5-20 micrograms, 
obtained from 1-5X10' cells). The preparation can be scaled up; for such large-scale production, it 
may be advantageous to employ higher yield technologies for expression of the recombinant proteins 

35 (e.^. , baculovirus, yeast, or E. coli). 

Expression of the pCDl secreted protein product (the first two domains of CD4 through 
ser|83, plus the 37 amino acid linker) was analyzed. BSC-1 cells were transfected with pCDI and 
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infected with wild type vaccinia virus, then incubated overnight at 37° C. Supematants were 
analyzed by Western (immnunoblot) analysis, using antibodies against CD4. As expected, the 
protein encoded by pCDl migrated slightly more slowly than standard purified two-domain sCD4 
(Upjohn- Pharmacia, Kalamazoo, MI). 
5 The pCD3 full-length sCD4-SCF v( 1 7b) (sCD4- 1 7b) fusion protein has been expressed and 

tested similarly, and 17b SCFv domain (as cloned into pSC59) can be examined likewise. The 
sCD4-17b fusion protein (at least a portion of which is secreted) has the expected molecular size 
(approximately 55 kD) when analyzed by SDS PAGE and Western blotting. The protein reacted 
strongly with antibodies against CD4 or the hexa-his tag, confirming the presence of these N- 
1 0 terminal and C-terminal moieties, as well as the correct reading frame. 

B. Purification 

Expressed fusion protein as constracted above with an amino-terminal hexa-his tag was 
purified using this molecular tag. The tag enables the specific binding and purification of the fusion 
protein by binding to nickel-nitrilotriacetic acid (Ni-NTA) metal affinity chromatography matrix 
15 (see, for instance, The QIAexpressionist, QIAGEN, 1997). A hexa-his tag was used in the present 

examples. 

Alternative purification methods include a combination of HPLC and conventional Hquid 
column chromatography (gel filtration; ion exchanger; isoelectric focusing). 

C. Primary Characterization 

20 In order to test gpl20 binding to the 17b domain of the sCD4-17b fusion protein, 96-weil 

ELISA plates were first coated with the 13B8.2 anti-CD4 MAb (Beckman Coulter, Chaska, MN, 
Catalogue no. IM0398), whose epitope on CD4 overlaps determinants involved m binding to gpl20. 
The plates were then incubated with either the purified sCD4-l 7b or control buffer. When the 
chimeric protein was captured this way, the 17b moiety remained available to bind gpl20 complexed 

25 to sCD4; however the sCD4 moiety could not bind free gpl20, since it was captured on the plate by 

the anti-CD4 MAb that blocks the binding site. The plates were incubated with gpl20 (IIIB isolate, 
Ratner et al. Nature 313:277-284, 1985) complexed to sCD4. Binding of gpl20 was detected by a 
polyclonal anti-gpl20 antiserum, followed by anti-rabbit IgG conjugated to horseradish peroxidase. 
The plates were washed and incubated with ABTS substrate, and the oxidized product was 

30 quantitated by measuring absorbance at 405 nm. The results indicated specific binding: absorbance 

values were 0. 15 with the sCD4-17b chimeric protein, compared to 0.05 with the control buffer. 

For testing functionality of the sCD4 region of the chimeric protein, the ELISA plates were 
first coated with an anti-His tag MAb (QIAGEN Inc., Valencia, CA, Catalog no. 34670), then 
incubated with either the purified chimeric protein or control buffer. With the chimeric protein 

35 captured in this way, the sCD4 moiety was available to bind free gpl20; however the 17b moiety 

could not bind gpl20 that was not complexed to sCD4. The plates were incubated with free gpI20, 
and binding was detected as detailed above. The results indicated specific binding: absorbance 
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values were 0.46 with sCD4-I7b, compared to 0.05 with the control buffer. Thus, the ELISA assays 
confirmed the expected functional binding properties for each moiety of tlie chimeric protein; 17b 
moiety bound the gpl20/sCD4 complex, and the CD4 moiety bound free gpl20. 

EXAMPLE 3 
HIV-envelope neutralization measurements 
A. Vaccinia-based reporter gene cell fusion 

Env-mediated cell fusion activated by CD4 was measured using the vaccinia-based reporter 
gene assay {Nussbaum et al, J. Virol. 68:541 1-5422, 1994). For the experiment shown in Table 2 
and Fig. 1, effector cells (HeLa) were transfected with plasmid pGlNT7- C-gal (lacZ linked to T7 
promoter), then infected with vaccinia recombinants encoding either the mutant uncleaved Unc Env 
or the wildtype (WT) SF162 Env (Broder & Berger, Proc. Natl. Acad. Sci., USA 92:9004-9008, 

1995) . Target cells were created by transfecting NIH 3T3 cells with plasmid pGA9-CKR5, 
containing the CCR5 cDNA linked to a vaccinia promoter (Alkhatib et al. Science 272:1955-1958, 

1996) , then infecting these cells with wild type vaccinia virus WR. Target cells also carry and 
express a bacteriophage T7 RNA polymerase. Prior to fusion assays, transfected cells were 
mcubated overnight at 3 1° C to allow expression of recombinant proteins, then washed. 

For each fusion assay, mixtures of effector and target cells (1x10^ of each cell type per 
well, duplicate wells) were prepared in the absence or presence of sCD4 (100 nM final). After 2.5 
hours at 37° C, cells were lysed with NP-40 and B-gal activity was quantitated using standard 
procedures (Table 2 and Fig. 1). Relative fiision (specific 6 -gal activity) was determined from the 
mean of duplicate samples, and calculated as WT-Unc. 

Table 2 

Vaccinia-based reporter gene cell fusion assay using soluble CD4 

Total J3-gal Relative fusion 

(Raw data) (WT-Unc) 
Unc Env (Control) WT Env (SF 162) 
duplicates mean duplicates mean 

-sCD4 0.50 0.40 0.45 0.50 0.40 0.45 0.0 

+ sCD4 0.40 0.50 0.45 6.60 5.20 5.90 5.45 



This vaccinia-based fusion assay can be used to assess the neutralizing ability of the herein 
disclosed bispecific fusion proteins. The neutralizing ability of MAb 17b was demonstrated to be 
dependent on the addition of soluble CD4 as follows (see Table 3 and Fig. 2). Effector cells were 
created by co-infecting HeLa cells with a vaccinia recombinant encoding HIV- 1 Env (LAV) (Broder 
& Berger, Proc. Nail. Acad. Sci., USA 92:9004-9008, 1995), and another encoding T7 RNA 
polymerase. Target cells were created by co-transfecting NIH 3T3 cells with plasmids pYFl-fusin 
(Feng et a!.. Science 272:872-877, 1996) encoding CXCR4, and pGlNT7-8-gal (lacZ linked to the T7 
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promofer). The target cells were then infected with vaccinia viruses vCB-3 (encoding CD4, standard 
assay) (Broder et al., Virology 193:483-491, 1993), or WR (wild type virus, sCD4 assay). As 
background controls, target cells were transfected with pGlNT7- B-gal only {i.e., no coreceptor). 
Transfected cells were incubated overnight at 3 1 ° C to allow expression of recombinant proteins, 
then washed. Effector cells were incubated 30 minutes at 37° C with the indicated concentration of 
MAb 17b (Table 3). 

For fusion assays, mixtures were prepared between effector and indicated target cells (2 x 
10^ of each cell type per well, duplicate wells); in the standard assay, target cells expressed CXCR4 
and CD4, and no soluble CD4 added; in the sCD4 assay, target cells expressed CXCR4 alone, and 
soluble CD4 was added (200 nM final). After 2.5 hours at 37° C, cells were lysed and B -gal activity 
was quantitated. Background control 6 -gal values (standard assay, 0.6; sCD4 assay, 0.2), obtained 
with target cells lacking coreceptor, were subtracted to give the data presented in Table 3. Data 
represent percentage of control (no MAb) for each assay. 

Table 3 

MAb-tnediated inhibition of fusion assay 



[17b] 


Standard Assay 


sCD4A 


.ssay 


(|ig/ml) 


6-gal 


% control 


B-gal 


% control 


none 


42.3 


100.0 


11.89 


100.0 


0.1 


39.5 


93.4 


13.55 


113.9 


0.5 


43.9 


103.8 


4.66 


39.2 


1 


39.8 


94.1 


1.68 


14.1 


5 


50.5 


119.4 


0 


0 



The effectiveness of the herein described bispecific fusion proteins for neutralizing fusion is 
tested in a similar maimer, by adding varying amounts of the bispecific fusion protein, e.g. sCD4- 
SCFv(17b), to the above assay. Exogenous sCD4 and SCFv(17b) or other gpl20-binding proteins 
need not be added, though they can be used as controls as above, or to determine relative inhibitory 
efficiencies compared to the bispecific fusion protein. Using this assay, the effects of media from 
control cells infected with wild-type vaccinia virus WR, were compared with media from cells 
infected with the recombinant vaccinia virus encoding sCD4-l 7b. The relative specific p- 
galactosidase values were 23.4 with the control media and <] with sCD4-SCFv media. Thus, the 
chimeric sCD4-17b protein strongly inhibited HIV-1 Env-mediated cell fusion. 

EXAMPLE 4 
Large scale production and analysis of sCD4-17b 

To produce large amounts of the sCD4-17b protein, the DNA construct has been transferred 
to the pETl lb plasmid vector (Novagen, Madison, WI, Catalog no. 69437-3), which is suitable for 
high level inducible expression in E. coli. This system involves cloning of target genes under control 
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of strong bacteriophage T7 transcription signal. Once established in a non-expression host bacterial 
cell, plasmids are then transferred into expression hosts containing a chromosomal copy of the T7 
KNA polymerase gene under lacUVS control, and expression of the recombinant protein of interest 
(here, sCD4-17b) is induced by the addition of IPTG. The expressed protein is produced at a very 
high level, and may constitute more than 50% of the total cell protein in the induced culture within a 
few hours after induction. Western blot results indicate high level expression of the sCD4-I7b from 
thepETllbplasmid. 

The protein produced can be denatured and renatured from inclusion bodies to provide a 
large quantity of functional sCD4-17b protein. This protein can be used for in vitro studies to test 
inhibition in assays of both Env-mediated cell fusion and HIV infection (p24 production). 

In addition, the sCD4-17b protein can be used for in vivo studies. One in vivo model 
involves SCID mice reconstituted with human thymus plus liver (Pettoello-Montovani et al, 
J.Infect.Dis. 177:337-346, 1998); this system will be used to test whether sCD4-17b inhibits (and to 
what extent), or prevents, acute HIV-1 infection. This system has been successfully used to 
demonstrate potent blocking activities of other anti-HIV agents {e.g., protease inhibitors and reverse 
transcriptase inhibitors, and Env-targeted toxins) (Pettoello-Montovani et al, J.Infect.Dis. 177:337- 
346, 1998). 

A second example of an in vivo system for testing sCD4-17b activity involves rhesus 
macaques challenged with SHIV viruses (recombinant viruses containing SIV gag and pol, plus an 
HIV envelope; Li et al.,J. Virol. 69:7061-7071, 1995). This system will be used to test whether the 
sCD4-17b protein inhibits (and to what extent), or prevents, acute SHIV infection. 

The effects of sCD4-17b against chronic infection will also be examined, again using the 
SCID-hu/HIV-1 mouse system and the macaque/SHIV system. 

Both in vitro and in vivo study systems also will be used to test the potency of sCD4-17b 
protein when used in combination with other anti-HIV agents {e.g., RT and protease inhibitors or 
other HIV-1 neutralizing MAbs). 

The foregoing examples are provided by way of illustration only. Numerous variations on 
the biological molecules and methods described above may be employed to make and use bispecific 
fusion molecules capable of binding to two sites on a single protein, and especially two sites on the 
HIV envelope protein gpl20, and for their use in detection, treatment, and prevention of HIV 
infection. We claim all such subject matter that falls within the scope and spirit of the following 
claims. 
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We claim: 

1 . A neatralizing bispecific fusion piotein enable of bmdmg to two sites on a target 
pKrtcm, corqpiising a first binding domain capable of binding to an inducing site on the taiget piotdn. 
thereby exposing an induced epitope; a second binding domain capable of forming a neutralizing 
complex -widi an induced epitcp e of flie laigct protein; and a linker connecting the first domain to ibe 
second do main, vAerein at least one of the jSrst and second binding domains congjrises a binding 
portion of a variable region of an antibody heavy or light chain. 

2. A protein according to claim 1, wi^xein die first binding domain conqjrises a 
binding portion of a variable region of an antibody heavy or light chain. 

3. A protein sccocding to claim 2, wherein die first binding domain comprises an 
^lopc binding domain of an antibody. 

4. A protein accoiding TO claim 3, vdicreinfte fust domain comprises a single-chain 
Fv(SCFv). 

5- A protem according to claim 3, -wherein &e antibody binding domain mimics a 
biological activity of a CD4 molecule in binding to &e inducing site and esqKJsing the inducing 
epitope. 

6. A piotein according to claim S, -wherein the antibody binding domain is derived 
from a CD4 anti-idioiypic antibody. 

7. A protein according to claim 1, whcrdh fte target protein is a viral envelope 
protein of a vims. 

8. A protein according to claim 7, wherein fee virus is human immunodeficiency virus 

(HIV). 

9. A protein according to claim 8, wherein the viral envelope protem is gpl20. 

10. A protein according to claim 2, wherein die first binding domain is derived from a 
CD4 molecule. 

11. A protein according to claim 10, -wherein fee first binding r^nm^iT, coniprises CD4 
Dl orCIMDlD2. 

12. A protein according to claim 11, •wherein fee first binding doinain is sCD4. 

13. A protein according to claim 1, wherein fee second binding domain conqjrises a 
binding portion of a variable region of an antibody heavy or light chain. 

14. A protein according to clauu 13, wherein fee second binding domain comprises a 
binding domain of an antibody. 

15. A protein according to claim 14, wherein fee second binding domain comprises a 
single-chain Fv (SCFv). 

16. A protein according to claim 15, wherein fee SCFv is selected team the groixp 
consisting of SCFvlTb, SCFv48d and SCFvCGlO. 

1 7. A protein accordiog to claim 14, wherein tiie antibody binding i^nrT^a-fn is derived 
fiom a neutralizing monoclonal antibody. 
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1 8. A protein according to claim 17, whorein fte neunalizing inDnoclond^:ja>ody is 
selected firom the group consisting of 17b, 48d, and CGIO. 

1 9. A protein according to claim 9, wherein die second binding domain mimics a 
biological activity of an HIV corecepior molecule in binding to gpl20. 

20. A protein according to claim 19, wbercin the second binding domain comprises a 
peptide fiagmcnt of an HIV coicceptor. 

21. A protein according to claim 20, wherein the HIV weceptoi is a diemokine 

receptor. 

22. A protein acc<»ding to claim 21, wbeiein the chemoVine receptor is selected fiom 
the groi^j consisting of CXC314, CCR2B, CCS3. and CCR5, CCRE, CCR9, CXjCRl, US28, or the 
chemokine receptoridated proteins including STRL33, GPR15, API, ChemR23, and BLTR. 

23 . A protein according to claim 9, wherein the induced qpitqpe comprises at least one 
coreceptorbindiing determinant of gpl20. 

24. A protein according to claim 9, wherein the inducing site is a gpl 20 CD4 binding 

site. 

25. A protein according to claim 14, wherein the binding domain of the antibody is 
capable of binding to at least one corecepior tiding determinant of gpl20. 

26. A protein according to claim 1, wherein ih.9 linVwr maintains Ihe second binding 
domain in binding proximity to fhe induced epitope when the first binding domain is bound to die 
inducing site. 

27. A protein according to claim 26, wherein the linker is substantially flexible. 

28. A protein according to claim 26. wherein the linker is 15-100 ai^stroms (A) long. 

29. A protdn according to claim 26. wherein Qie linker is 1 Q-IOD amino acid residues 
in length. 

30. A protein according to claim 26, wherein the linker coii5>rise5 at least one 
occurrence of an amino acid sequence as represented by SEQ ID NO: 1. 

31. A protein according to claim 1, wherein fhe linker comprises at least one 
occurrence of an amino add sequence represented by SEQ ID NO: 1 . 

32. A protein according to claim 3 1, wherein the linker conqjrises an amiTif. acid 
sequence represented by SEQ ID NO: 2. 

33. A functional recombinant bispedfic fusion protein capable of binding to two sites 
on gpl20, conqirising: 

a) 5CD4; 

b) SCFv(17b),- and 

c) a linker of a Icngfli sufBcient to maintain the SaFv(l7b) in binding 
proximity an SCFv(17b) epitope when sC3>4 is bound to gpl20. 

34. A protein according to claim 33, wherein die Knker has an '^^r^r. acid sequence as 
represented by SEQ ID NO: 2 

35. An isolated nucleic acid molecule encoding a protein according to clakn 34. 
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36. A nucleic acid molecule according to claim 35, wherdn the nucleic add sequence 
is tepresenied by SEQ ID NO: 3. 

37. A protein encoded for by Ihe micleic acid molecule according' to claim 36. 
3S. An isolated xmcldc acid molecule encoding a protein according to claim 1. 

39. The nucleic acid molecule according to claim 38, having nucleic acid sequence 
SEQ ID NO: 4. 

40. A transgenic eukaiyotic cell comprising &e isolated nucleic acid molecule 
according to claim 38. 

41. Aine&odforFnidacingiQaeukaxyoticceaafiucdoiialrccDnibiz^ 
fusion piotein capable of binding two sites on a target ^tein, conq>iising Hie steps of: 

a) transfecting the eukaiyotic cell with a recombinant nucleic acid molecule 
according to claim 38; 

b) culttning the transfected eukar^tic cell under coodidons that cause 
production of the protein; and 

c) recovering die protein produced by Ihe cultured eukaiyotic ceU. 

42. The method of claim41, wherein ^ eulraiyotic cell is a mammalian cell. 

43. Themethodof claim 41, wherein recoveiing the protein comprises: 

a) identifying the protein by the presence of a molecular tag; and 

b) separating the protein having the naoleculax tag so identified team 
molecules without the tag, so as to recover its protein produced by the cultured eukaryotic ceU. 

44. A method for inactrvating a gpl20 protein, comprisii^ Hoc step of: 
contacting the gpl20 protein with a protein according to claim 9, or a variant 

protein, analog or mimetic dieieof. 

45. A method for neutralizing a human immunodeficiency virus, conqirising the st^ 

of: 

contacting the human immunodeficiency viitis -with a protein according to claim 9, 
or a variant protein, analog or mimetic thereof. 

46. A method for blocking and preventing the binding of a viral or recombinani gpI20 
protein to sohible CD4 or lymphocyte C3X, compxising the step of: 

contacting the gpl20 protein with a protein according to claim 9, or a variant 
protein, analog or mimetic thereof. 

47. A method for inhibiting HTV vims replication or infectivity in a subject, coxr^nising 
administering to the subject an amount of the protein accoiding to claim 9, or a variant protein, 
analog or mimetic diereof. suiScient to inhibit HIV virus replication or infectivity. 

48. A coQ^osition comprising the protein according to claim 1, or a variant protein, 
aoalog or mimetic thereof. 

49. A pharmaceutical composition comprising the protein according to claim 1, or a 
variant protein, analog or mimetic thereof, and a phaxmaceutica]ly acceptable carrier. 
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50. The method according lo claim 47, wherein die protein is administered in the form 
of a phannacculical composition. 

51. A protein analog, derivative, or rmmetic of Hie protein of claim 1 . 

52. A kit for treatment and/or prevention of HTV infection, 

a dinically efifeciive dose of die neutralizing bispedfic iiision protein of claim 1. 

53. The kit of claim 52, fiiither conqHisiag instructions. 

54. The kit of claim 53, wherein the instructions include directions for administering at 
least one dose of the neutcaliang bispedfic fiiaon protein to a subject in need of such treatment 

55. The kit of claim 52, wherein the neutralizing bispscific fusion protein is provided in 
the fonn of a pharmaceutical conipositioii. 
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statements made on infoimation and belief are believed to be true; and further that these statements were made 
with the knowledge that willful false statements and the like so made are punishable by fine or imprisonment, or 
both, under Section 1001 of Title IS of the United States Code and that such willful false statements may 
jeopardize the validity of the application or any patent issued thereon. 



\ Edward A. Berger 




Full Name of First Joint Inventor: 
Inventor's Signature 
Residence; Rockvilie , Maryland 
Citizenship: United Slates of America 

Post Office Address: 5820 Inman Park Circle, Rockville, Maryland 20852 
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COMBINED DECLARATION AND POWER OF ATTORNEY 
FOR PATENT APPLICATION 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name. 

I believe I am an original, first and joint inventor of die subject matter which is claimed and for 
which a patent is sought on tlie invention emifled A NOVEL CHIMERIC PROTEIN FOR PREVENTION AND 
TREATMENT OF HIV INFECTION, the specification of which 

13 is attached hereto. 

□ was filed on as United States Application No, _^ , 

1^ was filed on 16 March 2000 as International Application No. PCT/U$00/06946 . 

Q and was amended on (if applicable). 

□ witlt amendments through (if applicable). 

I hereby state that I have reviewed and understand the contents of the above-identified specification, 
including the claims, as amended by any amendment referred to above. 

1 acknowledge the duty to disclose information which is material to patentability as defined in Title 
37, Code of Federal Regulations, § 1 -56. If this is a continuation-in-part application filed under the conditions 
specified in 35 U.SC § 120 which discloses and claims subject matter in addition to that disclosed in the prior 
copending application, 1 further acknowledge the duty to disclose material information as defined in 37 CFR 
§ ) .56 which occurred between the filing date of the prior application and the national or PCT international 
filing date of the continuation-in-part application. 

I hereby claim foreign priority benefits under Title 35, United States Code, § 1 19(a)-(d) of any 
foreign applicatlon(s) for patent or inventor's certificate or of an PCT IntemationaJ appiication(s) designating at 
least one country other than the United States of America listed below and have also identified below any 
foreign application(s) for patent or inventor's certificate or any PCT International application(s) designating at 
least one countiy other than the United States of America filed by me on the same subject matter having a filing 
date before that of the application(s) on which priority is claimed: 



Prior Foreign Application(s) 




Priority Claimed 




1 


□ Yes nNo 



I hereby claim the benefit under Title 35, United States Code, § 1 1 9(e) of any United States 
provisional application(s) listed below: 



60/124,681 


March 16, 1999 


(Application No.) 


(Filing Date) 



I hereby claim the benefit under Title 35, United States Code, § 120 of any United States 
appUcation(s) or § 365(c) of any PCT International application{s) designating the United States, listed below 
and, insofar as the subject matter of each of the claims of this application is not disclosed in the prior United 
States or PCT Iniemational application in the manner provided by the first paragraph of Title 35, United States 
Code, § 1 12, 1 acknowledge the duty to disclose material information as defmed in Title 37, Code of Federal 
Regulations, § 1 .56(a) which occurred between the filing date of the prior application and the national or PCT 
Internationa! filing date of this application: 
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Full Name of Second Joint Inventor: Christie M. Del CastiUo— 

Inventor's Signature ('"^^^ ^y.^^ ^^^ ^^^'^r^^^ e/z^loi 

Residence: San Francisco, California 
Citizenship: United States of America 

Post Office Address: ^2264fegirSffeet, San Francisco, California 94 122 
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SEQUENCE LISTING 



PCT/USOO/06946 



<110> Berger, et al. 

<120> NOVEL CHIMERIC PROTEIN FOR PREVENTION AND TREATMENT OF 
HIV INFECTION 

<130> 4239-54106 Berger PCT 



<150> 60/124,681 
<151> 1999-03-16 

<160> 11 

<170> PatencIn Ver. 2.1 

<210> 1 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: linker peptide 
<400> 1 

Gly Giy Gly Gly Ser 
1 5 



<210> 2 
<211> 35 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: seven repeat 
polypeptide linker 



<400> 2 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
15 10 15 

Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
20 25 30 



Gly Gly Ser 
35 



<210> 3 
<211> 508 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CD4-scFv ( 17b) 



<400> 3 



wo 00/55207 PCT/USOO/06946 

2 

Met Asn Arg Gly Val Pro Phe Arg riis Leu Leu Leu Val Leu Gin Leu 
15 10 15 

Ala Leu Leu Pro Ala Ala Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 30 " " 

Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 
35 4 0 4 5 

lie Gin Phe His Trp Lys Asn Ser Asn Gin lie Lys He Leu Gly Asn 
50 55 60 

Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Ala 
65 70 75 ^ 80 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Pro Leu He He 
85 90 95 

Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He Cys Glu Val Glu 
100 105 110 

Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

Ser Asp Thr His Leu Leu Gin Gly Gin Ser Leu Thr Leu Thr Leu Glu 
130 135 140 

Ser Pro Pro Gly Ser Ser Pro Ser Val Gin Cys Arg Ser Pro Arg Gly 
145 150 155 160 

Lys Asn He Gin Gly Gly Lys Thr Leu Ser Val Ser Gin Leu Glu Leu 
165 170 175 

Gin Asp Ser Gly Thr Trp Thr Cys Thr Val Leu Gin Asn Gin Lys Lys 
180 185 190 

Val Glu Phe Lys He Asp He Val Val Leu Ala Phe Gin Lys Ala Ser 
195 200 205 

Gly Gly Gly Gly Ser Gly Gly Glv Gly Ser Glv Gly Gly Gly Ser Glv 
210 215 220 

Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
225 230 235 240 

Gly Gly Ser Gin Val Gin Leu Leu Glu Ser Gly Ala Glu Val Lys Lys 
245 250 255 

Pro Gly Ser Ser Val Lys Val Ser Cys Lys Ala Ser Gly Asp Thr Phe 
260 265 270 

He Arg Tyr Ser Phe Thr Trp Val Arg Gin Ala Pro Gly Gin Gly Leu 
275 280 285 

Glu Trp Met Gly Arg He He Thr He Leu Asp Val Ala His Tyr Ala 
290 295 300 

Pro His Leu Gin Gly Arg Val Thr He Thr Ala Asp Lys Ser Thr Ser 
305 310 315 320 

Thr Val Tyr Leu Glu Leu Arg Asn Leu Arg Ser Asp Asp Thr Ala Val 
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325 



3 

330 



PCT/USOO/06946 

335 



Tyr 
Asp 



Ala Gly 
340 



Val Tyr Glu Gly Glu Ala Asp Glu Gly Glu Tyr 



Asn Asn 
355 



Thr Ser 
370 



Gly Phe 
Gly Gly 



Leu Lys His Trp Gly Gin Gly 
360 



Thr Leu 
365 



lly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
375 380 



Gly Ser Glu Leu Glu 
385 



Ser Pro Ala 
395 



Thr Leu Se; 



Val Ser 
400 



i Gly Glu 
■ Asp Leu 



lie Tyr 
435 



Arg Ala 
405 



Thr Leu Ser 



Cys Arg Ala 
410 



Ser Glu Se. 



Val Ser 
415 



Ala Trp 
420 



Gly Ala Ser Thr 



Lys Pro Gly i 
425 

Ala Thr Gly ' 



1 Ala Pro 
430 



Pro Ala 
445 



Arg Leu 
Arg Phe 



Gly Ser Gly Ala Glu Phe Thr Leu 
455 



He Ser Ser Leu 



Gin 
4 65 



Ser Glu Asp Phe Ala Val Tyr 



Tyr Cys Gin Gin Tyr Asn Asn Trp 
475 480 



Pro Arg 
Pro Arg 



Tyr Thr 
485 



Gly Se: 
500 



470 

Phe Gly Gin Gly Thr Arg Leu Glu He 



His His Hx: 
505 



<210> 4 
<211> 1440 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CD4-scFv ( 17b) 
<400> 4 

atgaaccggg gagtcccttt taggcacttg cttctggtgc tgcaactggc gctcctccca 60 

gcagccactc agggaaagaa agtggtgctg ggcaaaaaag gggatacagt ggaactgacc 120 

tgtacagctt cccagaagaa gagcatacaa ttccactgga aaaactccaa ccagataaag 180 

attctgggaa atcagggctc cttcttaact aaaggtccat ccaagctgaa tgatcgcgct 24 0 

gactcaagaa gaagcctttg ggaccaagga aacttccccc tgatcatcaa gaatcttaag 300 

atagaagact cagatactta catctgtgaa gtggaggacc agaaggagga ggtgcaattg 360 

ctagtgttcg gattgactgc caactctgac acccacctgc ttcaggggca gagcctgacc 420 

ctgaccttgg agagcccccc tggtagtagc ccctcagtgc aatgtaggag tccaaggggt 480 

aaaaacatac agggggggaa gaccctctcc gtgtctcagc tggagctcca ggatagtggc 54 0 

acctggacat gcactgtctt gcagaaccag aagaaggtgg agttcaaaat agacatcgtg 600 

gtgctagctt tccagaaggc ctccggaggt ggcggtagtg ggggaggcgg ttcaggcgga 660 

ggtggatccg gtggcggagg gtcgggcggg ggtggaagcg ggggtggcgg ctccggaggc 720 

ggaggttcac aggtgcagct gctcgagtct ggggctgagg tgaagaagcc tgggtcctcg 780 

gtgaaggtct cctgcaaggc ctctggagac accttcatca gatatagttt tacctgggtg 840 

cgacaggccc ctggacaagg ccttgagrgg atgggaagga tcatcactat ccttgatgta 900 

gcacactacg caccgcacct ccagggcaga gtcacgatta ccgcggacaa gtccacgagc 960 
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4 

acagtctacc tggagctgcg gaarctaaga tctgacgata cggccgtata rttctgtgcg 1020 

ggagtgtacg agggagaggc ggacgagggg gaatatgata araatgggtt tctgaaacat 1080 

tggggccagg gaaccctggt cacggtcacc tcaggtggcg gtggctccgg aggtggtggg 1140 

agcggtggcg gcggatctga actcgagttg acgcagtctc cagccaccct gtctgtgtct 1200 

ccaggggaaa gagccacccr ctcctgcagg gccagtgaga gtgttagtag cgacctagcc 1260 

tggtaccagc agaaacctgg ccaggctccc aggctcctca tatatggtgc atccaccagg 1320 

gccaccggtg tcccagccag gtrcagtggc agtgggtctg gggcagaatt cactctcacc 1380 

atcagcagcc tgcagrctga agattttgca g^ttattact grcagcag-ca caataactgg 1440 



<210> 5 
<211> 127 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
oligonucleotide 

<400> 5 

cctccggagg tggcggtagt gggggaggcg gttcaggcgg aggtggatcc ggaggcggag 60 

ggtcgggcgg gggtggaagc gggggtggcg gctctggtgq cggaggtacc actagttaag 120 
tgagtag 127 



<210> 6 
<211> 39 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
encoded by SEQ ID NO: 5 

<400> 6 

Ala Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 
15 10 15 

Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
20 25 30 

Gly Gly Gly Gly Thr Thr Ser 
35 



<210> 7 
<211> 103 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 7 

ttttatggat ccggtggcgg agggtcgggc gggggtggaa gcgggggtgg cggctccgga 60 
ggcggaggtt cacaggtgca gctgctcgag tctggggctg agg 103 



<210> 8 
<211> 32 
<212> PRT 
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<213> Artificial Sequence 



5 
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<220> 

<223> Description of Artificial Sequence: peptide 
encoded for by SEQ ID NO: 7 

<400> 8 

Gly Ser Gly Gly Gly Gly Ser Giy Gly Gly Gly Ser Gly Gly Gly Gly 
15 10 15 

Ser Gly Gly Gly Gly Ser Gin Val Gin Leu Leu Glu Ser Gly Ala Glu 
20 25 30 



<210> 9 
<211> 65 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 9 

taatttatcg atcacgtgac tagtcctagg cccgggtcaa tgatgatgat gatgatggcc 60 
gctgc _ 



<210> 10 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
encoded for by SEQ ID NO: 9 

<400> 10 

Ser Gly His His His His His His 
1 5 



<210> 11 
<211> 131 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: reverse 
oligonucleotide 

<400> 11 

ctagctactc acttaactag tggtacctcc gccacctgag ccgccacccc cgcttccacc 60 
ccccgcccga ccctccgcct ccggatccac ccccgcctga accgcctccc cactaccgcc 120 
acctccggag g 



